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REGISTRATION DATE 
DATE D • EHREGISTREKBJT 
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ESCAPE SEQUENCE 


SEQUENCE 

D'ECHAPPEMHIT 


Gq set / jeu Gq 


Gi ■ set / jeu G| 


MULTI BYTE SET 
JEU KULTIPLBT 


ESC 2/1 4/0 



hake/iiom 


The set of control characters of the ISO 646. 

Jeu de oaracteres de commande de la norme ISO 646. 


DESCRIPTION 

The set of 32 control characters as described in ISO 646. 

Jeu de 32 oaracteres de commande tels que ddcrits dans le norme ISO 646. 


SPONSOR/ ORGAN ISKE DE PARRAINAGE 


Secretariat ISO/TC Stf/SC 2 
Secretariat ISO/TC 97/SC 2 


RIGIN (US3R)/0RIGINS ( UTI LISATEUR) 


International Standard ISO 646. 
Norme Internationale ISO 646. 


lELD 0? UTILIZATION /DCHAINE D'AFPLICATIOK 


Coded information interchange. 
Echange d* inf ormation codee. 
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Control character 
(abbreviation) 

Column 
according 
to the 
„ type of 

Row 

CO (7 or 

8 bits) 

0 or 00 

1 or 01 

s 

8 

bits 

06 

09 

7 bits 
FINAL 

4 

5 

0 

NUL 

tc^dle) 

1 

TCj (SOH) 

DC 1 

2 

tc 2 (etx) 

dc 2 

3 

tc 3 (etx) 

dc 3 

4 

TC4(E0T) 

dc 4 

5 

tc 5 (enq) 

TCq(NAK) 

6 

tc 6 (ack) 

TCg(SYN) 

7 

BEL 

tc 10 (etb) 

8 

fE 0 (BS) 

CAN 

9 

FEj (HT) 

EM 

10 

fe 2 (lf)(*) 

SOB 

11 

fe 3 (vt)(*) 

ESC 

12 

fe 4 (ff)(*) 

is 4 (fs) 

13 

fe 5 (cr)(*) 

is 3 (gs) 

14 

so 

is 2 (rs) 

/ 

15 

SI 

is^us) 


Note : The format effectors are intended for equipments in which horizontal and vertical 
movements are effected separately. If equipment requires the action of CARRIAGE RETURN to 
he combined with a vertical movement, the format effector for that vertical movement may 
be used to effect the combined movement. For example, if NErf LINE (CR+LP) is required, 

FE2 shall be used to represent it. This substitution requires agreement between the sender 
and the recipient of the data. The use of these combined functions may be restricted for 
international transmission on general switched telecommunication networks (telegraph and 
telephone networks). 


4.5 
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Position 


List of nemonios and meanings 


Abbreviati 


Definition 


A control chapter used to acomplish media- 
fill or time-fill. Null characters may be 
inserted into or removed from a stream of 
data without affecting the information content 
of that stream. But then the addition or remo- 
val of there characters may affect the informa- 
tion lay out and/or the control of equipment . 


Transmission con- 
trol character 1 . 
(Start of heading 

Transmission con- 
trol character 2 
(start of text) 

Transmission con- 
trol character 3 
(End of text) 


TC, (SOH) 


tc 2 (stx) 


TC^ETX) 


A transmission control character used as the 
first character of a heading of an information 
message. 

A transmission control character which prece- 
des a text and which is used to terminate a 
heading. 


A transmission control character which 
terminates a text. 


Transmission con- 
trol character 4 TC, (EOT) 

(End.of transmis 4 
sion) 


A transmission control character used t o in- 
dicate the conclusion of the transmission of 
one or more texts. 


Transmission con- 
trol character 5 jC (ENQ) 
(Enquiry) ® 


Transmission con- . 
trol character 6 TCg (ACK) 
(Acknowledge) 


Format effector 0 __ t ac .\ 
(Backspace) PE ° (BS) 


A transmission control character used as a 
request for a response from a remote station; 
the response may include station indentifica- 
tion and/or station status. When a "Who are 
you" function is required on the general 
switched transmission network, the first use 
of ENQ after the connection is established 
shall have the meaning "Who are you" (station 
identification) . Subseouent use of ENQ may, or 
may-not, include the function "Who are you", 
as determined by agreement. 

A -transmission control character transmitted 
by a receiver as an affirmative response to 
the sender. 

A control character that is used when there 
is a need to call for attention ; it may 
control alarm or attention devices. 

A format effector which moves the active 
position one character position backwards on 
the same line . 


Format effector 1 
(Horizontal tabu- j>g (jyj) 
lation) 


A format effector which advances the active 
position to the next pre-determined 
character position on the same line. 
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List of nemonics'and meanings 

Position 

Name 

Abbreviation 

Definition 

0/10 

Format effector 2 
(Line feed) 


A format effector which advances the active 
position to the same character position of 
the nextline. 

0/11 

Format effector 3 
(Vertical tabula- 
tion) 


A format effector which advances the active 
position to the same character position on 
the next pre-determined line. 

0/12 

Format effector 
(Form feed) 


A format effector which advances the active 
position to the same character position on 
a pre-determined line of the next form or 
page. 

0/13 

Format effector 
(Carriage return) 

fe 5 (cr) 

A format effector which moves the active po- 
sition to the first character position on 
the same line. 

0/14 

Shift out 

SO 

A control character which is used in conjunc- 
tion with SHIFT-IN and ESCAPE to extend ’the 
graphic character set of the code. It may alta- 
the meaning of the bit combinations of columns 

2 to 7 which follow it until a SHIFT-IN cha- 
racter is reached. However, the characters 

SPACE (2/0) and DELETE (7/1 5) are unaffected 
by SHIFT-OUT. The effect of thi 3 character 
when using code extension techniques is 
described in International Standard ISO 2022 

0/15 

Shift in 

SI 

A control character which is used in conjunc- 
tion whith SHIFT-CUT and ESCAPE to extend the 
graphic character set of the code. It may 
reinstate the standard meanings of the bit 
combinations which follow it. The effect of 
this character when using code extension 
techniques is described in International 
Standard ISO 2022. 

1/0 

Transmission 
control charac- 
ter 7 (Data 
link escape) 

TC 7 (DLB) 

. 

A transmission control character which will 
change the meaning of a limited number of 
contiguously following characters. It is 
used exclusively to provide supplementary 
data transmission control functions . Only 
graphic characters and transmission control 
characters can be used in DLE sequences. 
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List of nemonics and meanings 


Position Name 


Abbreviatio 


Definition 


Device control 1 DC, 


Device control 2 DC. 


A device control character which is primarily 
intended for turning on or starting an ancil- 
lary device. If it as not required for this 
purpose, it may be used to restore a device 
to the basic mode of operation (see also 
DCp and DC 3 ), or for any other device control 
function not provided by other DCs. 

A device control character which is primarily 
intended for turning on or starting an 
anci ary device If it is not required for 
this purpose, it may be used to set a device 
to a special mode of operation (in which case 
DCj is used to restore the device to the 
basic mode), or for any other device control 
function not provided by other DCs. 


Device control 3 DC 3 


A device control character which is primarily 
intended for turning off or stopping an ancil 
lary device. This function may be a secondary 
level stop, for example, wait, pause, stand-by 
.or halt (in which case DC. is used to restore 
normal operation) . If it is not required for 
this purpose, it may be used for any other 
device control function not provided by other 
DCs. 


Device control 4 DC, 


Transmission con- 
trol character 8 TC (nak) 
(Negative ack- “ 

nowledge) 


A device control character which is primarily 
intended for turning off, stopping or inter- 
rupting an ancillary device. If it is not 
required for this purpose, it may be used for 
any other device control function not provi- 
ded by other DCs. 

A transmission control character transmitted 
by a receiver as a negative response to the 
sender. 


Transmission con-1 

trol character 9 ■ TCg (SYN) 

(Synchronous idlq 


A transmission control character used by a 
synchronous transmission system in the absence 
of any other character (idle condition) to 
provide a signal from which synchronism may 
be achieved or retained between data terminal 
equipment . 
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List of mnemonics and meanings 


Position 

Name 

Abbreviation 

Definition 

1/7 

Transmission 
control character 
10 (End. of trans- 
mission block) 

TC 10 (ETB) 

A transmission control character used 
to indicate the end of a transmission 
block of data where data is divided 
into such blocks for transmission 
purposes. 

1/8 

Cancel 

CAM 

A character, or the first character of 
a sequence, indicating that the data 
preceding it is in error. As a result, 
this data is to be ignored. The 
specific meaning of this character must 
be defined for each application and/or 
between sender and recipient. 

1/9 

End of medium 

EM 

A control character that may be used to 
identify the physical end of a medium, 
or the end of the used portion of a 
medium, or the end of the wanted por- 
tion of data recorded on a medium. The 
position of this character does not 
necessarily correspond to the physical 
end of the medium. 

1/10 

Subs titute 
character 

SUB 

A control character used in the place 
of a character that has been found to 
be invalid or in error. SUB is inten- 
ded to be introduced by automatic means. 

1/11 

Escape 

ESC 

A control character which is used to 
provide additional control functions. It 
alters the meaning of a limited number 
of contiguously following bit combina- 
tions. The use of this character is spe- 
cified in International Standard ISO 

2022. 

1/12 

Information 
separator 4 
(File separator) 

IS 4 (FS) 

A control character used to separate and 
qualify data logicially ; its specific 
meaning has to be defined for each appli- 
cation. If this character is used in 
hierarchical order, it delimits a data 
item called a FILE. 

1/13 

Information 
separator 3 
(Groupe separator) 

is 3 (cs) 

A control character used to separate and 
qualify data logicially;its specific 
meaning has to be defined for each appli- 
cation. If this character is used in 
hierarchical order, it delimits a data 
item called a GROUP. 
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List of nemonics and meanings 

Position 

Name 

Abbreviation 

Definition 

1/14 

Information 
separator 2 
(Record sepa- 
rator) 

is 2 (rs) 

A control character used 
to separate and qualify 
data logically ; its 
specific meaning has to 
be defined for each appli- 
cation. If this character 
is used in hierarchical 
order, it delimits a 
data item called a 

RECORD. 

1/15 

Information 
separator 1 
(Unit separa- 
tor) 

is, ( us) 

A control character used 
to separate and qualify 
data logically ; its 
specific meaning has to 
be defined for each appli- 
cation. If this character 
is used in hierarchical 
order, it delimits a 
data item called a 

UNIT. 


4.8 









TYPE 

GRAPHIC CHARACTER SET 

JEU DE CARACTERES 

GRAPHIQUES 

REGISTRATION NUMBER 

NUHERO D'ENREGISTREMENT 

002 

REGISTRATION DATE 

DATW 'H » RTJTVWnT.qTRiTT.’PTMT 

ESCAPE SEQUENCE 

G 0 set / jeu Gq 

ESC 2/8 

4/0 





Gj set / jeu G-| 

ESC 2/9 

4/0 

1975 / 12/01 

SEQUENCE 

D'ECHAPPEMENT 

MULTIBYTE SET 

JEU MULTIPLET 



nake/nom 


The graphic set of characters of the international reference version of ISO 646 

Jeu de caracteres graphiques de la version internationals de reference de la 
norme ISO 646 


[DESCRIPTION 


See attached table and legend. .This table and legend are identical to those given 
in ISO 646 . Graphic characters are those shown in column 2 to 7» except position 2/0 
and 7/15. 

Voir lea tableaux et ldgendes ci-jointa . Ces demiers sont identiques a ceux qui sont 
donnes dans la norme ISO 646 . Les caracteres graphiques sont ceux qui figurent dans 
les colonnes 2 k 7. sauf en ce qui conceme les positions 2/0 et 7/1 5 . 

IPONSOR/ORGANISKE DE PARRAINAGE 

Secretariat ISO/TC 97/SC 2 

Secretariat ISO/TC 97/SC 2 


3RIGIM (US3R)/0RIGIKE (UTILISATEUR) 

ISO 646 - 1973 
Norme ISO 646-1973 

TIELD OF UTILIZATION/BGKAIUE D' APPLICATION 

For use when there is no requirement to use a national or an application oriented 
version. 

A utiliser lorsqu'il n'est pas sp^cifi^ d'employer une version nationale ou une 
version destinee A une application particuliere. 


3.2 















The graphic characters of the international reference version 
of the ISO 646. 

Coded representation when invoked in columns 2-7 of code table. 
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POSITION WHEN 
INVOKED IN 
COLUMNS 2-7 



NAME OR MEANINC 

(ADDITIONAL NAMES IN PARENTHESIS ARE ALTERNATE MEANINGS 
DETERMINED BY SYNTAX OR USAGE AND ARE HOT SUBSTITUTION 
ALTERNATIVES). 


Exclamation mark 

Quotation mark, diaeresis (See note) 

Number sign 

Currency sign 

Per cent sign 

Ampersand 

Apostrophe, acute accent (See note) 

Left parenthesis 

Right parenthesis 

Asterisk 

Plus sign 

Comma, cedilla (See note) 

Hyphen ( minus sign) 

Full stop (period) 

Solidus 
Digit zero 
Digit one 
Digit two 
Digit three 
Digit four 
Digit f ive 
Digit six 
Digit seven 
Digit e ight 
Digit nine 













LEGENDS 

POSITION WHEN 

INVOKED IN GRAPHIC NAME OR MEANING 

COLUMNS 2-7 


3/10 

< 

Colon 

3/11 

i 

Semi-colon 

3/12 

< 

Less than sign 

3/13 

= 

Equals sign 

3/14 

> 

Greater than sign 

3/15 

7 

Question mark 

4/0 

6) 

Commercial at 

4/1 

A 

Capital letter A 

4/2 

B 

Capital letter B 

4/3 

C 

Capital letter C 

4/4 

D 

Capital letter D 

4/5 

E 

Capital letter E 

4/6 

F 

Capital le tter F 

4/7 

G 

Capital letter G 

4/8 

H 

Capital letter H 

4/9 

I 

Capital letter I 

4/10 

J 

Capital letter J 

4/11 

K 

Capital letter K 

4/12 

L 

Capital letter L 

4/13 

M 

Capital letter M 

4/14 

N 

Capital letter N 

4/15 

0 

Capital letter 0 

5/0 

P 

Capital letter P 

5/1 

Q 

Capital letter Q 

5/2 

K 

Capital letter R 
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LEGENDS 

POSITION 

WHEN INVOKED 
IN COLUMNS 

2-7 

GRAHUC. 

NAME OR MEANING 


Capital letter S 
Capital letter T 
Capital letter U 
Capital letter V 
Capital letter W 
Capital letter X 
Capital letter Y 
Capital letter Z 
Left square bracket 
Reverse solidus 
Right square bracket 

Upward arrow head, circumflex accent (See note) 

Underline 

Grave accent 

Small letter a 

Small letter b 

Small letter c 

Small letter d ■ 

Small letter e 
Small letter f 
Small letter g 
Small letter h 
Small letter i 
Small letter j 
Small letter k 
Small letter 1 
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LEGENDS 

POSITION WHEN 
INVOKED IN 

COLUMNS 2-7 

GRAPHIC 

NAME OR MEANING 

6/13 

D 

Small letter m 

6/14 

n 

Small letter n 

6/15 

o 

Small letter o 

7/0 

p 

Small letter p 

7/1 

q 

Small letter q 

7/2 

r 

Small letter r 

7/3 

s 

Small letter s 

7/4 

t 

Small letter t 

7/5 

XL 

Small letter u 

7/6 

V 

Small letter v 

.7/7 

V 

Small letter w 

7/8 

x 

Small letter x 

7/9 

y 

Small letter y 

7/10 

Z 

Small letter z 

7/11 

I 

Left curly bracket 

7/12 

1 

Vertical line 

7/13 

} 

Right curly bracket 

7/14 


Overline 
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ISO/I EC 8859-1:1997 (E) 


Foreword 


ISO (the International Organization for Standardization) and IEC (the 
International Electrotechnical Commission) form the specialized 
system for worldwide standardization. National bodies that are 
members of ISO or IEC participate in the development of 
International Standards through technical committees established by 
the respective organization to deal with particular fields of technical 
activity. ISO and IEC technical committees collabpfajfe in fields of 
mutual interest. Other international organizations/governmental and 
nongovernmental, in liaison with ISO and \EC/a\$o t^e part in the 
work. 


In the field of information technology, IS 
a joint technical committee, ISO/IEC JtTC 
Standards adopted by the joint techmcaTcom 
national bodies for voting. PubJkjatlorTas-an 
requires approval by at \easyv5% of the i\iati' 
vote. 


International Standard \|SO/ 
Technical Committe^NsO/IEC^JK 
Subcommittee SQ^, /Character s&fs 



d IEC have^est^blished 
Draft International 
are circulated to 
tem^tfonal Standard 
nal bodies casting a 


as prepared by Joint 
Information technology, 
d information coding. 



ISO/IEC 8859 / corrsjsts x qf the following parts, under the general title 
Information technology -\8-bit single-byte coded graphic character 
sets: 

- Part 1: Latin alphabet No. 1 
Part 2: Latin alphabet No. 2 

'n alphabet No. 3 
'atin alphabet No. 4 
Latin/Cyrillic alphabet 
rt 6: Latin/Arabic alphabet 
Part 7: Latin/Greek alphabet 

- Part 8: Latin/Hebrew alphabet 

- Part 9: Latin alphabet No. 5 

- Part 10: Latin alphabet No. 6 

Annexes A to C of this part of ISO/IEC 8859 are for information only. 


iii 
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Introduction 

ISO/IEC 8859 consists of several parts. Each part specifies a set of 
up to 191 graphic characters and the coded representation of these 
characters by means of a single 8-bit byte. Each set is intended for 
use for a particular group of languages. 
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Bnformation technology - 

8-bit single-byte coded graphic character sets - 

Parti: Latin alphabet No. 1 


1 Scope 

This part of ISO/IEC 8859 specifies a set of 191 
coded graphic characters identified as Latin 
alphabet No. 1 . 

This set of coded graphic characters is intended for 
use in data and text processing applications and 
also for information interchange. 

The set contains graphic characters used for 
general purpose applications in typical office 
environments in at least the following languages: 

Albanian, Basque, Breton, Catalan, Danish, Dutch, 
English, Faroese, Finnish, French (with restrictions, 
see Annex A.1, Notes), Frisian, Galician, German, 
Greenlandic, Icelandic, Irish Gaelic (new ortho- 
graphy), Italian, Latin, Luxemburgish, Norwegian, 
Portuguese, Rhaeto-Romanic, Scottish Gaelic, 
Spanish and Swedish. 

This set of coded graphic characters\maK I be 
regarded as a version of an 8-bit code accoMingMo 
ISO/IEC 2022 or ISO/IEC 4873 aTtewJT V 

This part of ISO/IEC 8859 may tet^be us&d^in 
conjunction with any other parts <3f f^O/IEG^8359. 
If coded characters fron/mbm thari one/part are to 
be used together, by m^anVof code extension 
techniques, the ejqu4va!ent\:oabd character sets 
from ISO/IEC 10^67 sbqu|3~M use*) instead within 
a version of ISp/lEC 48T3~aTteve1 / 2 or level 3. 

The codedNcrt^ractersHn this set may be used in 
conjunction witfKcoaed control functions selected 
fronylGO/IEC / B^29\Ho\yever, control functions are 
not^us^jjocr^ate cohrposite graphic symbols from 
two or morngraphic characters (see clause 6). 

NOTE ISb/lEC 8859 is not intended for use with 
Telematic services defined by ITU-T. If information coded 
according to ISO<(lfeC 8859 is to be transferred to such 
services, it will ^ave to conform to the requirements of 
those services at the access-point. 

2 Conformance 

2.1 Conformance of information interchange 

A coded-character-data-element (CC-data-element) 
within coded information for interchange is in 
conformance with this part of ISO/IEC 8859 if all the 


coded representations of graphic characters within 
that CC-data-element conforai tpstheTequirements 
of clause 6. \. N \ 

2.2 Conformance of devices \/ 

A device is in ^donformance—with this part of 
ISO/IEC 8859 iNt conforms te-tbe/equi remen ts of 
2.2.1 , and eittfej/or both on2.2.2 and 2.2.3. A claim 
of conformance shhij identify) the document which 
contain^the(dedcrihtiom^ppdified in 2.2.1. 

S e de^pnDtfon 

t conforms to this part of ISO/IEC 8859 
subject of a description that identifies 
I which the user may supply characters 
, or may recognize them when they are 
maae avanaDle to him, as specified respectively in 

2.2.2 and 2.2.3. 

^2.2 Originating devices 

An originating device shall allow its user to supply 
any sequence of characters from those specified in 
clause 6, and shall be capable of transmitting their 
coded representations within a CC-data-element. 

2.2.3 Receiving devices 

A receiving device shall be capable of receiving and 
interpreting any coded representations of characters 
that are within a CC-data-element, and that conform 
to clause 6, and shall make the corresponding 
characters available to its user in such a way that 
the user can identify them from among those 
specified there, and can distinguish them from each 
other. 

3 Normative references 

The following standards contain provisions which, 
through reference in this text, constitute provisions 
of this part of ISO/IEC 8859. At the time of publica- 
tion, the editions indicated were valid. All standards 
are subject to revision, and parties to agreements 
based on this part of ISO/IEC 8859 are encouraged 
to investigate the possibility of applying the most 
recent editions of the standards indicated below. 
Members of IEC and ISO maintain registers of 
currently valid International Standards. 


1 
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ISO/IEC 2022:1994, Information technology - 
Character code structure and extension techniques. 

ISO/IEC 4873:1991, Information technology - 
ISO 8-bit code for information interchange - 
Structure and rules for implementation. 

ISO/IEC 8824-1:1995, Information technology - 
Abstract Syntax Notation One (ASN.1): Specifica- 
tion of basic notation. 

4 Definitions 

For the purposes of this part of ISO/IEC 8859 the 
following definitions apply: 

4.1 bit combination: An ordered set of bits used 
for the representation of characters. 

4.2 byte: A bit string that is operated upon as a unit. 

4.3 character: A member of a set of elements 
used for the organization, control, or representation 
of data. 

4.4 code table: A table showing the characters 
allocated to each bit combination in a code. 

4.5 coded character set; code: A set of 

unambiguous rules that establishes a character set 
and the one-to-one relationship between the 
characters of the set and their bit combinations. 

4.6 coded-character-data-element <^C 
element): An element of interchanged in 
that is specified to consist of a sequence o 
representations of characters, m'accocdance 
one or more identified standards, for^cpd 
character sets. 

4.7 graphic character/ A\haradter, other than a 
control function, that haka visualX representation 
normally handwrittgfH-pilntedsOr oispWed, and that 
has a coded rep(resentatjonoonsi^ting of one or 
more bit combfnatibnsX 


NOTE - ln4St3/IE0885'9sa sir 
to represjefnt ^actKchafacter 



gle bit combination is used 


visual representation of a 
''a control function. 

That part of a code table identified 
nd row coordinates. 


5 Notation^bode table, and names 
5.1 Notation 

The bits of the bit combinations of the 8-bit code are 
identified by b 8 , b 7 , b 6 , b 5 , b 4 , b 3 , b 2 , and b v where 
b 8 is the highest-order, or most-significant bit and b 1 
is the lowest-order, or least-significant bit. 


The bit combinations may be interpreted to represent 
numbers in binary notation by attributing the 
following weights to the individual bits: 


Bit 

b 8 

b 7 

b 6 

b 5 

b 4 

b 3 

b 2 

b i 

Weight 

128 

64 

32 

16 

8 

4 

2 

1 


Using these weights, the bit combinations are 
identified by notations of the form/xx/yy, where xx 
and yy are numbers in the ra mje 00 to 15. The 
correspondence between the/hotatiohs of the form 
xx/yy and the bit combinatipi^s^eonsistinKof the bits 
b 8 to b 1 is as follows: 


xx is the number represented by b 8 , u 7 , 


where these bits 
1 respectively. 


re given tr 


j 6 and b 5 


jhts 8, 4, 2, and 




- yy is the.numbemepres^t^d by b 4 , b 3 , b 2 and b 1 
where thp^e/bits/ar^>given\tl)e weights 8, 4, 2, and 
1 respec^ive^ 

rcomdina^pd^ar^ also identified by notations 
form hkv where h and k are numbers in the 
to F in neltadecimal notation. The number 
le same as the number xx described above, 
and the ntknber k the same as the number yy 
describeckbbove. 


5.2 Layout of the code table 

n 8-bit code table consists of 256 positions 
Arranged in 16 columns and 16 rows. The columns 
and the rows are numbered 00 to 15. In hexa- 
decimal notation the columns and the rows are 
numbered 0 to F. 


The code table positions are identified by notations 
of the form xx/yy, where xx is the column number 
and yy is the row number. The column and row 
numbers are shown at the top and left edges of the 
table respectively. The code table positions are 
also identified by notations of the form hk, where h 
is the column number and k is the row number in 
hexadecimal notation. The column and row 
numbers are shown at the bottom and right edges of 
the table respectively. 

The positions of the code table are in one-to-one 
correspondence with the bit combinations of the 
code. The notation of a code table position, of the 
form xx/yy, or of the form hk, is the same as that of 
the corresponding bit combination. 

5.3 Names and meanings 

This part of ISO/IEC 8859 assigns a unique name 
and a unique identifier to each graphic character. 
These names and identifiers have been taken from 
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ISO/IEC 10646-1 (E). This part of ISO/IEC 8859 
also specifies an acronym for each of the characters 
SPACE, NO-BREAK SPACE and SOFT HYPHEN. 
For acronyms only Latin capital letters A to Z are 
used. It is intended that the acronyms be retained in 
all translations of the text. 

Except for SPACE (SP), NO-BREAK SPACE 
(NBSP) and SOFT HYPHEN (SHY), this part of 
ISO/IEC 8859 does not define and does not restrict 
the meanings of graphic characters. 

This part of ISO/IEC 8859 specifies a graphic 
symbol for each graphic character. This symbol is 
shown in the corresponding position of the code 
table. However, this part, or any other part, of 
ISO/IEC 8859 does not specify a particular style or 
font design for imaging graphic characters. Annex 
B of ISO/IEC 10367 gives further information on this 
subject. 

5.3.1 SPACE (SP) 

A graphic character the visual representation of 
which consists of the absence of a graphic symbol. 

5.3.2 NO-BREAK SPACE (NBSP) 

A graphic character the visual representation of 
which consists of the absence of a graphic symbol, 
for use when a line break is to be prevented in the 
text as presented. 

5.3.3 SOFT HYPHEN (SHY) 

A graphic character that is im 
symbol identical with, or similar t 
HYPHEN, for use when a line 
established within a won 


Table 1 - Character set, coded representation 


6 Specification 

This part of ISO/i 
allocated to 
(table 2). 
character 



acter set 

jecifie^191 characters 
^nations of the code table 
traders are combining 


/ Cojr(bipi ng characters are described in ISO/IEC 
i?:19'94/£ubefause 6T3f3. 


Controk functions, such as BACKSPACE or 
CARRIAtS£ RETURN, shall not be used to create 
composite grapj^te' symbols, which are made up 
from the graphic representations of two or more 
characters. 

6.1 Characters of the set and their coded 
representation 

See table 1 . 



Bit 




combi- 

Hex 

Identifier 

Name 

nation 




02/00 

20 

U+0020 

SPACE 

02/01 

21 

U+0021 

EXCLAMATION MARK 

02/02 

22 

U+0022 

QUOTATION MARK 

02/03 

23 

U+0023 

NUMBER SIGN 

02/04 

24 

U+0024 

DOLLAR SIGN 

02/05 

25 

U+0025 

PERCENT SIGN /\ 

02/06 

26 

U+0026 

AMPERSAND / / 

02/07 

27 

U+0027 

APOSTROPHE / < 

02/08 

28 

U+0028 

LEFT PARENTHESIS \ 

02/09 

29 

U+0029 

RIGHT PARENTIS \ \ 

02/10 

2A 

U+002A 

ASTERISK \ \ 

02/11 

2B 

U+002B 

PLUS SIGN \ \ > 

02/12 

2C 

U+002C 

COMMA \ \ V 

02/13 

2D 

U+002D 

HYFFIEN-MfNUS 

02/14 

2E 

U+002E 

R/LLSTOP — 7 

02/15 

2F 

14002^ 

\SOLIUUS \ r — — 

/blGIT ZERO\ \ 

03/00 

30 

U+p03O 

03/01 

31 

uioojyf 

DIGIT ONE \ \ 

03/02 

3 2 / 

Uj=0O32 

/DiqjTTWO \ / 

03/03 

03/04/ 

03/05 

/03/0^ 

03/07 

f 

36 

39 

OJ+0034 

0*0035 

^GIGITFOUR 
\D l£8T FJVE 

¥gjfsix 

DIGIT SEVEN 

TIG IT EIGHT 

DIGIT NINE 

U+0636 

U+003> 

U+0038 

\U+0039 


/03/Q8 

03/09\ 

03/10 

X 

36 s 

U40O3A 

COLON 

03/11 

JU+003B 

SEMICOLON 

03/12 

3C 

U+003C 

LESS-THAN SIGN 

03/13 

3D 

U+003D 

EQUALS SIGN 

03/14 

3E 

U+003E 

GREATER-THAN SIGN 

.03/15 

/b4/00 

3F 

U+003F 

QUESTION MARK 

40 

U+0040 

COMMERCIAL AT 

04/01 

41 

U+0041 

LATIN CAPITAL LETTER A 

04/02 

42 

U+0042 

LATIN CAPITAL LETTER B 

04/03 

43 

U+0043 

LATIN CAPITAL LETTER C 

04/04 

44 

U+0044 

LATIN CAPITAL LETTER D 

04/05 

45 

U+0045 

LATIN CAPITAL LETTER E 

04/06 

46 

U+0046 

LATIN CAPITAL LETTER F 

04/07 

47 

U+0047 

LATIN CAPITAL LETTER G 

04/08 

48 

U+0048 

LATIN CAPITAL LETTER H 

04/09 

49 

U+0049 

LATIN CAPITAL LETTER 1 

04/10 

4A 

U+004A 

LATIN CAPITAL LETTER J 

04/11 

4B 

U+004B 

LATIN CAPITAL LETTER K 

04/12 

4C 

U+004C 

LATIN CAPITAL LETTER L 

04/13 

4D 

U+004D 

LATIN CAPITAL LETTER M 

04/14 

4E 

U+004E 

LATIN CAPITAL LETTER N 

04/15 

4F 

U+004F 

LATIN CAPITAL LETTER 0 

05/00 

50 

U+0050 

LATIN CAPITAL LETTER P 

05/01 

51 

U+0051 

LATIN CAPITAL LETTER Q 

05/02 

52 

U+0052 

LATIN CAPITAL LETTER R 

05/03 

53 

U+0053 

LATIN CAPITAL LETTER S 

05/04 

54 

U+0054 

LATIN CAPITAL LETTER T 

05/05 

55 

U+0055 

LATIN CAPITAL LETTER U 

05/06 

56 

U+0056 

LATIN CAPITAL LETTER V 

05/07 

57 

U+0057 

LATIN CAPITAL LETTER W 

05/08 

58 

U+0058 

LATIN CAPITAL LETTER X 

05/09 

59 

U+0059 

LATIN CAPITAL LETTER Y 

05/10 

5A 

U+005A 

LATIN CAPITAL LETTER Z 

05/11 

5B 

U+005B 

LEFT SQUARE BRACKET 

05/12 

5C 

U+005C 

REVERSE SOLIDUS 

05/13 

5D 

U+005D 

RIGHT SQUARE BRACKET 

05/14 

5E 

U+005E 

CIRCUMFLEX ACCENT 

05/15 

5F 

U+005F 

LOW LINE 
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Table 1 (continued) 


Table 1 (concluded) 


Hex 

Identifier 

Name 

60 

U+0060 

GRAVE ACCENT 

61 

U+0061 

LATIN SMALL LETTER A 

62 

U+0062 

LATIN SMALL LETTER B 

63 

U+0063 

LATIN SMALL LETTER C 

64 

U+0064 

LATIN SMALL LETTER D 

65 

U+0065 

LATIN SMALL LETTER E 

66 

U+0066 

LATIN SMALL LETTER F 

67 

U+0067 

LATIN SMALL LETTER G 

68 

U+0068 

LATIN SMALL LETTER H 

69 

U+0069 

LATIN SMALL LETTER 1 

6A 

U+006A 

LATIN SMALL LETTER J 

6B 

U+006B 

LATIN SMALL LETTER K 

6C 

U+006C 

LATIN SMALL LETTER L 

6D 

U+006D 

LATIN SMALL LETTER M 

6E 

U+006E 

LATIN SMALL LETTER N 

6F 

U+006F 

LATIN SMALL LETTER 0 

70 

U+0070 

LATIN SMALL LETTER P 

71 

U+0071 

LATIN SMALL LETTER Q 

72 

U+0072 

LATIN SMALL LETTER R 

73 

U+0073 

LATIN SMALL LETTER S 

74 

U+0074 

LATIN SMALL LETTER T 

75 

U+0075 

LATIN SMALL LETTER U 

76 

U+0076 

LATIN SMALL LETTER V 

77 

U+0077 

LATIN SMALL LETTER W 

78 

U+0078 

LATIN SMALL LETTER X 

79 

U+0079 

LATIN SMALL LETTER Y 

7A 

U+007A 

LATIN SMALL LETTER Z 

7B 

U+007B 

LEFT CURLY BRACKET 

7C 

U+007C 

VERTICAL LINE 

7D 

U+007D 

RIGHT CURLY BRACKET 

7E 

U+007E 

TILDE 

AO 

U+00A0 

NO-BREAK SPACE 

A1 

U+00A1 

INVERTED EXCLAMATION MARK 

A2 

U+00A2 

CENT SIGN /\ 

A3 

U+00A3 

POUND SIGN / ^ 

A4 

U+00A4 

CURRENCY SIGN \ 

A5 

U+00A5 

YEN SIGN \ \ 

A6 

U+00A6 

BROKEN BAR. \ V 

A7 

U+00A7 

section4gn\ \ / 

A8 

U+00A8 

DIAERESIS\ X \ \ 

A9 

U+00A9 

COEYPIGHT StSN \ \ / 

AA 

U+00AA 

^FEMININTDReiNK INDICATOR 

AB 

U+00AB 

\EFT-F™T1NGJ30UBLE AM3LE C 

AC 

U+Q0&X 

NO\SIGN\ 

AD 

Mad 

X §OFTHYPHEN 

AE/ 

/ UY0OAE S 

REGISTERED StSN 

7 

/U+00AF 

> 

f 

< 

1 

BC V 

U+O0BO 

DEGREE'SIGN 


lktSob? 

PLU&MJNUS SIGN 


B2v U+0OB2 SUPERSCRIPT TWO 
X B3 «+00B3 SUPERSCRIPT THREE 
B4 U+80B4 ACUTE ACCENT 
B5 \U+0Cfe6 ^ROSIGN 
B6 UY00B6 kFILCROW SIGN 
B7 U+OOKp ' MIDDLE DOT 
B8 U+OOKS CEDILLA 
B9 U+00B9 SUPERSCRIPT ONE 
BA U+OOBA MASCULINE ORDINAL INDICATOR 
BB U+OOBB RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK 
BC U+OOBC VULGAR FRACTION ONE QUARTER 
BD U+OOBD VULGAR FRACTION ONE HALF 
BE U+OOBE VULGAR FRACTION THREE QUARTERS 
BF U+OOBF INVERTED QUESTION MARK 


Bit 

combi- 

nation 

Hex 

Identifier 

12/00 

CO 

U+OOCO 

12/01 

Cl 

U+00C1 

12/02 

C2 

U+00C2 

12/03 

C3 

U+00C3 

12/04 

C4 

U+00C4 

12/05 

C5 

U+00C5 

12/06 

C6 

U+00C6 

12/07 

C7 

U+00C7 

12/08 

C8 

U+00C8 

12/09 

C9 

U+00C9 

12/10 

CA 

U+OOCA 

12/11 

CB 

U+OOCB 

12/12 

CC 

U+OOCC 

12/13 

CD 

U+OOCD 

12/14 

CE 

U+OOCE 

12/15 

CF 

U+OOCF 

13/00 

DO 

U+OOD0' 

13/01 

D1 

U+O0D1 

13/02 

D2 

jAoojA 

13/03 

D3< 

MJ+O0D3 

13/04 

FM 

Hl+OOIW 

13/05/ 

/ vy 

U+80D5 

13/06 

/D6 

u+ood^ 

Am 

\)7 

U+00D7 

13 /Oa 

D^n 

U+00D8 

/1 3/09 

\D9 

X-00D9 

13/10 

x 

U+i)QDA 

13/11 

DB 

Hj+OODB 

13/12 

DC 

U+OODC 

13/13 

DD 

U+OODD 

13/14 

DE 

U+OODE 

/13/15 

DF 

U+OODF 

1/00 

EO 

U+00E0 

/I4/01 

El 

U+00E1 

14/02 

E2 

U+00E2 

14/03 

E3 

U+00E3 

14/04 

E4 

U+00E4 

14/05 

E5 

U+00E5 

14/06 

E6 

U+00E6 

14/07 

E7 

U+00E7 

14/08 

E8 

U+00E8 

14/09 

E9 

U+00E9 

14/10 

EA 

U+OOEA 

14/11 

EB 

U+OOEB 

14/12 

EC 

U+OOEC 

14/13 

ED 

U+OOED 

14/14 

EE 

U+OOEE 

14/15 

EF 

U+OOEF 

15/00 

FO 

U+00F0 

15/01 

FI 

U+00F1 

15/02 

F2 

U+00F2 

15/03 

F3 

U+00F3 

15/04 

F4 

U+00F4 

15/05 

F5 

U+00F5 

15/06 

F6 

U+00F6 

15/07 

F7 

U+00F7 

15/08 

F8 

U+00F8 

15/09 

F9 

U+00F9 

15/10 

FA 

U+OOFA 

15/11 

FB 

U+OOFB 

15/12 

FC 

U+OOFC 

15/13 

FD 

U+OOFD 

15/14 

FE 

U+OOFE 

15/15 

FF 

U+OOFF 


LATIN CAPITAL LETTER A WITH GRAVE 
LATIN CAPITAL LETTER A WITH ACUTE 
LATIN CAPITAL LETTER A WITH CIRCUMFLEX 
LATIN CAPITAL LETTER A WITH TILDE 
LATIN CAPITAL LETTER A WITH DIAERESIS 
LATIN CAPITAL LETTER A WR^NG ABOVE 
LATIN CAPITAL LETTER Ay / 

LATIN CAPITAL LETTER'D WITH CEDILLA 
LATIN CAPITAL LETTER EWITFK^RAVE 
LATIN CAPITAL LE^ERE WITH AbUTE 
LATIN CAPITAJ/feTTER E WimPIRCUMFLEX 
LATIN CAPITAL LETTER E WITH DIAERESIS 
LATIN CAPITAL LETTER I WITH GRAVE^ 
LATI^FSAPTIaL LETTER I WITH ACUTE 
LAllNCAPITAL LETTEFTIAMTFLpiRCUMFLEX 

■ AATTNCSPlTAb LETIEBJWIJn DIAERESIS 
I^TIN CAPITAlIlETTERETH (Icelandic) 
LETTER N WITH TILDE 

Letter o with grave 

llETTER 0 WITH ACUTE 
LETTER 0 WITH CIRCUMFLEX 
LETTER 0 WITH TILDE 
LETTER 0 WITH DIAERESIS 
N SIGN 

LETTER 0 WITH STROKE 
LATIN CAPITAL LETTER U WITH GRAVE 
LATIN CAPITAL LETTER U WITH ACUTE 
LATIN CAPITAL LETTER U WITH CIRCUMFLEX 
LATIN CAPITAL LETTER U WITH DIAERESIS 
LATIN CAPITAL LETTER Y WITH ACUTE 
LATIN CAPITAL LETTER THORN (Icelandic) 
LATIN SMALL LETTER SHARP S (German) 
LATIN SMALL LETTER A WITH GRAVE 
LATIN SMALL LETTER A WITH ACUTE 
LATIN SMALL LETTER A WITH CIRCUMFLEX 
LATIN SMALL LETTER A WITH TILDE 
LATIN SMALL LETTER A WITH DIAERESIS 
LATIN SMALL LETTER A WITH RING ABOVE 
LATIN SMALL LETTER AE 
LATIN SMALL LETTER C WITH CEDILLA 
LATIN SMALL LETTER E WITH GRAVE 
LATIN SMALL LETTER E WITH ACUTE 
LATIN SMALL LETTER E WITH CIRCUMFLEX 
LATIN SMALL LETTER E WITH DIAERESIS 
LATIN SMALL LETTER I WITH GRAVE 
LATIN SMALL LETTER I WITH ACUTE 
LATIN SMALL LETTER I WITH CIRCUMFLEX 
LATIN SMALL LETTER I WITH DIAERESIS 
LATIN SMALL LETTER ETH (Icelandic) 

LATIN SMALL LETTER N WITH TILDE 
LATIN SMALL LETTER 0 WITH GRAVE 
LATIN SMALL LETTER 0 WITH ACUTE 
LATIN SMALL LETTER 0 WITH CIRCUMFLEX 
LATIN SMALL LETTER 0 WITH TILDE 
LATIN SMALL LETTER 0 WITH DIAERESIS 
DIVISION SIGN 

LATIN SMALL LETTER 0 WITH STROKE 
LATIN SMALL LETTER U WITH GRAVE 
LATIN SMALL LETTER U WITH ACUTE 
LATIN SMALL LETTER U WITH CIRCUMFLEX 
LATIN SMALL LETTER U WITH DIAERESIS 
LATIN SMALL LETTER Y WITH ACUTE 
LATIN SMALL LETTER THORN (Icelandic) 

LATIN SMALL LETTER Y WITH DIAERESIS 
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6.2 Code table 

For each character in the set the code table 
(table 2) shows a graphic symbol at the position in 
the code table corresponding to the bit combination 
specified in table 1 . 


The shaded positions in the code table correspond 
to bit combinations that do not represent graphic 
characters. Their use is outside the scope of 
ISO/IEC 8859; it is specified in other International 
Standards, for example ISO/IEC 6429. 


Table 2 - Code table of Latin alphabet No. 1 


[3 OES [31 



o 

o 

00 

-Q 

0 

b? 

- 0 0 

0 

b< 

; 0 0 

1 

bs 

1 0 1 

0 

00 01 02 

00 


SP 

01 

J 

02 

II 

03 


# 

04 


$ 

05 


% 

06 


& 

07 

■■ 

B 

08 


x < 

09 

mm 

£ 


1 1 1 1 _ 

0 0 11 


1 J J 1 1 _ 

0 0 0 0 J 

o o i i 7 


1 A a a 


2 B R b 


— 1^^- 
i\ i 
o XT \i 


4 15 







10 10 


i i i 


1111 


® \ I 


ABC 


D E F X 
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7 Identification of the character set 

7.1 Identification according to ISO/IEC 2022 
and ISO/IEC 4873 

The graphic characters of this part of ISO/IEC 8859 
constitute a single coded character set. However in 
accordance with ISO/IEC 2022 and ISO/IEC 4873 
the code table of this part of ISO/IEC 8859 may be 
considered to consist of the following components: 

The character SPACE represented by bit 
combination 02/00; 

a 94-character GO graphic character set 
represented by bit combinations 02/01 to 07/14; 

- a 96-character G1 graphic character set 
represented by bit combinations 10/00 to 15/15. 

When the identification methods of ISO/IEC 2022 or 
ISO/IEC 4873 are used this part of ISO/IEC 8859 
shall be identified by the following pair of 
designation functions: 

GZD4 04/02 (ESC 02/08 04/02) 

G1D6 04/01 (ESC 02/13 04/01) 

NOTE - The corresponding escape sequences are 
shown in parentheses. 


7.2 Identification according to ISO/IEf 
(ASN.1) 


L8824-1 


In the terminology of ISO/IEC 8824-1 the 
set of this part of ISO/I EC/E&S9 and 
corresponding coded representations ar&^distin' 
and are known as the "character abstract syntax 
and the "character transfer synta 






When the identification methods of ISO/IEC 8824-1 
are used this part of ISO/IEC 8859 shall be 
identified by the following object identifiers: 

- character set 

{ iso standard 8859 1 abstract-syntax (1) } 

- coded representations 

{ iso standard 8859 1 transfer-syntax (0) } 

The corresponding object descrjpto 

- character set "ISO 885^p 

- coded representatio 

7.3 Identification/usitrg the ISO International 
register of codecLcharacter~set$ to be used 
with escape/^fequenc^s 

According to 7/t ajjove the\chkracter set of this part 
of ISO/IRC be considered to consist of 

the cha^cterySBACE/aNsW-character GO graphic 

6-character G1 graphic 
and G1 graphic character 
cl by the use of the Registration 
ISO International register of 
racter sets to be used with escape 


When these registration numbers are used this part 
of ISO/IEC 8859 shall be identified by the following 
air of registration numbers: 

> 

- GO graphic character set ISO-IR 6 

- G1 graphic character set ISO-IR 100 
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Annex A 

(informative) 

Coverage of languages by parts 1 to 10 of ISO/IEC 8859 


A.1 Languages of European origin written in Latin script 


The following parts of ISO/IEC 8859 specify coded 
character sets which comprise various different 
selections of characters based on the Latin 
alphabet. These sets are identified by the numbers 
1 to 6 as shown: 


The following official and regional l^ngpages written 
in Europe are covered by the Latin , alphabets 1-6 as 
indicated by number in table AO : \ 


ISO/IEC 

ISO/IEC 

ISO/IEC 

ISO/IEC 

ISO/IEC 

ISO/IEC 


Language 


8859-1 

8859-2 

8859-3 

8859-4 

8859-9 

8859-10 


Latin alphabet No. 1 
Latin alphabet No. 2 
Latin alphabet No. 3 
Latin alphabet No. 4 
Latin alphabet No. 5 
Latin alphabet No. 6 


Dutch 

English 

Esperanto 

Estonian 

Faroese 

Finnish 

French 


Table A.1 - Language coverage 




Covered by alphabet(s) Language 



1 4 5 6 

1 5 / 

12345/6 
3 \ 


4 \5 6 



1 \ 5\ Norwegian 

1 vX Polish 

1 2 3 4 5 6 Portuguese 

1 4 5 6 Rhaeto-Rom 

2 Romanian 

1 /\ 6 Sami 

/ / 5 6 Scottish Gae 

/ Slovak 

•n 3 5 Slovene 

1 2 3 4 5 6 Sorbian 

4 Spanish 

4 6 Swedish 

1 5 Turkish 


Covered by alphabet(s) 

1 

4 5 6 

2 

1 3 

5 

1 

5 

1 2 

1 

4 6 

5 

2 

2 

4 6 

2 

1 

5 

1 

4 5 6 

(3) 

5 


NOTES \ \ \ \ 

1 The li£fof> languages iKtame A.1 is not exhaustive. 
It shows the languaaesMhatwe included in the Scope 
clause of/eacj/part oTJSO/fEC 8859. 

2 xor wMng/French three characters (CE, oe, Y) not 
specified in parts 1, 3 and 9, are also needed. 

3 The various Sami languages use partly differing 
orthographies. .The,ef1aracter sets in parts 4 and 10 cover 
the requirements^ the Sami languages most commonly 
used in Finland, Norway and Sweden. For the Skolt Sami 
language used in Finland and Norway additional 
characters are needed. These are included in ISO-IR 158 
and 197. 


4 There are several official written languages outside 
Europe that are covered by Latin alphabet No. 1. 
Examples are Indonesian/Malay, Tagalog (Philippines), 
Swahili, Afrikaans. 

5 Use of Latin alphabet No. 3 for Turkish is deprecated. 
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A. 2 Languages written in non-Latin scripts 

The following parts of ISO/IEC 8859 specify coded 
character sets which include graphic characters 
from alphabets other than the Latin alphabet: 

ISO/IEC 8859-5 Latin/Cyrillic alphabet 

ISO/IEC 8859-6 Latin/Arabic alphabet 

ISO/IEC 8859-7 Latin/Greek alphabet 

ISO/IEC 8859-8 Latin/Hebrew alphabet 


The following official and regional languages are 
covered by these alphabets: 

The Cyrillic characters included in part 5 cover 
Bulgarian, Byelorussian, (Slavic) Macedonian, 
Russian, Serbian and Ukrainian (as written up to 
1990, see also Scope of part 5). 

The Arabic characters included/frypart 6 cover 
Arabic. The Greek characters^incKJded in part 7 
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ISO/I EC 8859-1:1997 (E) 


Annex B 

(informative) 

Main differences between the First edition and this Second edition of 

this part of ISO/IEC 8859 


B.1 The names of the graphic characters have 
been amended where necessary to align them with 
the names of characters adopted for all standards 
on coded character sets developed under the 
responsibility of ISO/IEC JTC 1 . For each character 
the short identifiers specified in ISO/IEC 10646-1 
Amendment 9 have been added to table 1. 

B.2 The new style of conformance clause, adopted 
for all standards on coded character sets, has been 
introduced. 

B.3 Object identifiers conforming to Abstract Syntax 
Notation One (ASN.1, see ISO/IEC 8824-1) are 
specified in 7.2 for the character set, and the 
corresponding coded representations, of this part of 
ISO/IEC 8859. 

Registration numbers from the International register 
of coded character sets to be used with escape 
sequences, have been included as an additional 


B.4 The previous Annex A (Geographical areas of 
application of the coded character set^f this part of 
ISO 8859) has been replaced by q/new Annex A 
that identifies the coverage pflanguages by parts 
1-10 of ISO/IEC 8859. \ 

The previous Annex B (Relationship with ISO 
6937/2) has been deleted. \ ^ 


B.5 Various edRoriaTadjtrstrrj 
have been madp^to the text 
hexadecimal equivalents of 
have bebn badea ter tables X 


ectsjyra clarifications 
of the standard. The 
)he bit combinations 
and 2, and a revised 


<rsed/fbi/the graphic symbols in 


Annex C, Bibliography, has been added. 
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Annex C 

(informative) 

Bibliography 

ISO/IEC 6429:1992, Information technology - Control functions for coded character sets. 

ISO/I EC 10367:1991, Information technology - Standardized coded graphic character sets for use in 
8-bit codes. / / 

ISO/IEC 10646-1 :1 993, Information technology - Universal Multiple-Octet Coded Character Set (UCS) - 
Part 1: Architecture and Basic Multilingual Plane. X \ 

ISO International register of coded character sets to be used with escape sequerfcek. ' \ 
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TYPE: 

96-character graphic character Set 

Jeu de 96 caracteres graphiques 

REGISTRATION NUMBER: 203 

NUMERO D ENREGISTREMENT: 


DATE OF REGISTRATION: 

DATE D ENREGISTREMENT: 

1998-09-16 

ESCAPE SEQUENCE 

GO: 

- 


SEQUENCE D ECHAPPEMENT 

G1: 

ESC 02/13 

06/02 


G2: 

ESC 02/14 

06/02 


G3: 

ESC 02/15 

06/02 


CO: 

- 



Cl: 

- 



NAME/NOM 


European supplementary Latin set ("Latin 9") 

Jeu supplementaire latin pour 1' Europe (« latin 9 ») 

DESCRIPTION 

A set for use with the IRV of ISO/IEC 646 (ISO-IR 6) in an 8-bit code or in a 7-bit 
environment with code extension techniques prescribed by ISO/IEC 2022) . The set is 
derived from the right-hand part of Latin Alphabet No. i (ISO-IR 100) by 
replacement of characters in columns 10 and 11 with letters needed for the French 
and Finnish languages, plus the EURO SIGN. 

Un jeu utilisable avec 1'IRV de 1' ISO/IEC 646 (ISO-IR 6) dans un code & 8 bits ou 
dans un environnement a 7 bits, a l'aide des techniques d' extension de code 
prescrites par 1'ISO/CEI 2022. Le jeu est derive de la partie droite de l'alphabet 
latin n° 1 (ISO-IR 10 0) ou l'on a remplace certains caracteres des colonnes 10 et 
11 d'une part par des lettres requises en frangais et an finnois, et d' autre part 
par le SYMBOLE EURO. 


SPONSOR/ORGANISME DE PARRAINAGE 

Canada (Standards Council of Canada/Conseil canadien des normes) , Denmark (Dansk 
Standard) , Finland (SFS - STY/Tieke) , France (AFNOR - Association frangaise de 
normalisation) , Ireland (NSAI - National Standards Authority of Ireland) 

ORIGIN/ORIGINE 

ISO/IEC JTC1/SC2/WG3 

FIELD OF UTILIZATION/DOMAINE D APPLICATION 

Communication and processing of text in European languages. The set 
provides for the languages enumerated in ISO/IEC 8859-1. In 
addition, it contains the EURO SIGN and provides support for the 
French, and Finnish languages in addition. 


Communication et traitement de texte dans les langues europeennes. 
Ce jeu s' applique a toutes les langues couvertes par l'alphabet 
latin n° 1. 11 ajoute par ailleurs le SYMBOLE EURO, et offre de plus 
un soutien integral du frangais et du finnois. 
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Pos. 


Name 


Note 


10/00 NO -BREAK SPACE 

10/01 INVERTED EXCLAMATION MARK 

10/02 CENT SIGN 

10/03 POUND SIGN 

10/04 EURO SIGN 

10/05 YEN SIGN 

10/06 LATIN CAPITAL LETTER S WITH CARON 
10/07 SECTION SIGN 

10/08 LATIN SMALL LETTER S WITH CARON 

10/09 COPYRIGHT SIGN 

10/10 FEMININE ORDINAL INDICATOR 

10/11 LEFT- POINTING DOUBLE ANGLE QUOTATION MARK 

10/12 NOT SIGN 

10/13 SOFT HYPHEN 

10/14 REGISTERED SIGN 

10/15 MACRON 

11/00 DEGREE SIGN 

11/01 PLUS -MINUS SIGN 

11/02 SUPERSCRIPT TWO 

11/03 SUPERSCRIPT THREE 

11/04 LATIN CAPITAL LETTER Z WITH CARON 
11/05 MICRO SIGN 
11/06 PILCROW SIGN 
11/07 MIDDLE DOT 

11/08 LATIN SMALL LETTER Z WITH CARON 

11/09 SUPERSCRIPT ONE 

11/10 MASCULINE ORDINAL INDICATOR 

11/11 RIGHT- POINTING DOUBLE ANGLE QUOTATION MARK 

11/12 LATIN CAPITAL LIGATURE OE 

11/13 LATIN SMALL LIGATURE OE 

11/14 LATIN CAPITAL LETTER Y WITH DIAERESIS 

11/15 INVERTED QUESTION MARK 


U+00A0 

U+00A1 

U+00A2 

U+00A3 

U+20AC 

U+00A5 

U+0160 

U+00A7 

U+0161 

U+00A9 

U+00AA 

U+00AB 

U+00AC 

U+00AD 

U+00AE 

U+00AF 

U+00B0 

U+00B1 

U+00B2 

U+00B3 

U+017D 

U+00B5 

U+00B6 

U+00B7 

U+017E 

U+00B9 

U+00BA 

U+00BB 

U+0152 

U+0153 

U+0178 

U+00BF 





































































































Pos. 

Name 

12/00 

LATIN 

CAPITAL 

LETTER 

A 

WITH 

GRAVE 

12/01 

LATIN 

CAPITAL 

LETTER 

A 

WITH 

ACUTE 

12/02 

LATIN 

CAPITAL 

LETTER 

A 

WITH 

CIRCUMFLEX 

12/03 

LATIN 

CAPITAL 

LETTER 

A 

WITH 

TILDE 

12/04 

LATIN 

CAPITAL 

LETTER 

A 

WITH 

DIAERESIS 

12/05 

LATIN 

CAPITAL 

LETTER 

A 

WITH 

RING ABOVE 

12/06 

LATIN 

CAPITAL 

LETTER 

AE 


12/07 

LATIN 

CAPITAL 

LETTER 

C 

WITH 

CEDILLA 

12/08 

LATIN 

CAPITAL 

LETTER 

E 

WITH 

GRAVE 

12/09 

LATIN 

CAPITAL 

LETTER 

E 

WITH 

ACUTE 

12/10 

LATIN 

CAPITAL 

LETTER 

E 

WITH 

CIRCUMFLEX 

12/11 

LATIN 

CAPITAL 

LETTER 

E 

WITH 

DIAERESIS 

12/12 

LATIN 

CAPITAL 

LETTER 

I 

WITH 

GRAVE 

12/13 

LATIN 

CAPITAL 

LETTER 

I 

WITH 

ACUTE 

12/14 

LATIN 

CAPITAL 

LETTER 

I 

WITH 

CIRCUMFLEX 

12/15 

LATIN 

CAPITAL 

LETTER 

I 

WITH 

DIAERESIS 

13/00 

LATIN 

CAPITAL 

LETTER 

ETH 


13/01 

LATIN 

CAPITAL 

LETTER 

N 

WITH 

TILDE 

13/02 

LATIN 

CAPITAL 

LETTER 

0 

WITH 

GRAVE 

13/03 

LATIN 

CAPITAL 

LETTER 

0 

WITH 

ACUTE 

13/04 

LATIN 

CAPITAL 

LETTER 

0 

WITH 

CIRCUMFLEX 

13/05 

LATIN 

CAPITAL 

LETTER 

0 

WITH 

TILDE 

13/06 

LATIN 

CAPITAL 

LETTER 

0 

WITH 

DIAERESIS 

13/07 

MULTIPLICATION SIGN 

13/08 

LATIN 

CAPITAL 

LETTER 

0 

WITH 

STROKE 

13/09 

LATIN 

CAPITAL 

LETTER 

U 

WITH 

GRAVE 

13/10 

LATIN 

CAPITAL 

LETTER 

U 

WITH 

ACUTE 

13/11 

LATIN 

CAPITAL 

LETTER 

U 

WITH 

CIRCUMFLEX 

13/12 

LATIN 

CAPITAL 

LETTER 

U 

WITH 

DIAERESIS 

13/13 

LATIN 

CAPITAL 

LETTER 

Y 

WITH 

ACUTE 

13/14 

LATIN 

CAPITAL 

LETTER 

THORN 


13/15 

LATIN 

SMALT. LETTER SHARP S 



Note 

u+ooco 

U+00C1 

U+00C2 

U+00C3 

U+OOC4 

U+00C5 

U+00C6 

U+00C7 

U+00C8 

U+00C9 

U+OOCA 

U+OOCB 

U+OOCC 

U+OOCD 

U+OOCE 

U+OOCF 

U+OODO 

U+OOD1 

U+00D2 

U+00D3 

U+OOD4 

U+00D5 

U+00D6 

U+00D7 

U+00D8 

U+00D9 

U+OODA 

U+OODB 

U+OODC 

U+OODD 

U+OODE 

U+OODF 




































































































Pos. 

Name 

Note 

14/00 

LATIN SMALL LETTER A WITH GRAVE 

U+00E0 

14/01 

LATIN SMALL LETTER A WITH ACUTE 

U+00E1 

14/02 

LATIN SMALL LETTER A WITH CIRCUMFLEX 

U+00E2 

14/03 

LATIN SMALL LETTER A WITH TILDE 

U+00E3 

14/04 

LATIN SMALL LETTER A WITH DIAERESIS 

U+00E4 

14/05 

LATIN SMALL LETTER A WITH RING ABOVE 

U+00E5 

14/06 

LATIN SMALL LETTER AE 

U+00E6 

14/07 

LATIN SMALL LETTER C WITH CEDILLA 

U+00E7 

14/08 

LATIN SMALL LETTER E WITH GRAVE 

U+00E8 

14/09 

LATIN SMALL LETTER E WITH ACUTE 

U+00E9 

14/10 

LATIN SMALL LETTER E WITH CIRCUMFLEX 

U+00EA 

14/11 

LATIN SMALL LETTER E WITH DIAERESIS 

U+00EB 

14/12 

LATIN SMALL LETTER I WITH GRAVE 

U+00EC 

14/13 

LATIN SMALL LETTER I WITH ACUTE 

U+00ED 

14/14 

LATIN SMALL LETTER I WITH CIRCUMFLEX 

U+00EE 

14/15 

LATIN SMALL LETTER I WITH DIAERESIS 

U+OOEF 

15/00 

LATIN SMALL LETTER ETH 

U+00F0 

15/01 

LATIN SMALL LETTER N WITH TILDE 

U+00F1 

15/02 

LATIN SMALL LETTER 0 WITH GRAVE 

U+00F2 

15/03 

LATIN SMALL LETTER 0 WITH ACUTE 

U+00F3 

15/04 

LATIN SMALL LETTER 0 WITH CIRCUMFLEX 

U+00F4 

15/05 

LATIN SMALL LETTER 0 WITH TILDE 

U+00F5 

15/06 

LATIN SMALL LETTER 0 WITH DIAERESIS 

U+00F6 

15/07 

DIVISION SIGN 

U+00F7 

15/08 

LATIN SMALL LETTER O WITH STROKE 

U+00F8 


15/09 

LATIN SMALL LETTER U WITH GRAVE 

U+00F9 

15/10 

LATIN SMALL LETTER U WITH ACUTE 

U+OOFA 

15/11 

LATIN SMALL LETTER U WITH CIRCUMFLEX 

U+OOFB 

15/12 

LATIN SMALL LETTER U WITH DIAERESIS 

U+OOFC 

15/13 

LATIN SMALL LETTER Y WITH ACUTE 

U+OOFD 

15/14 

LATIN SMALL LETTER THORN 

U+OOFE 


15/15 


LATIN SMALL LETTER Y WITH DIAERESIS 


U+OOFF 
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INTERNATIONAL STANDARD © ISO/IEC 


ISO/I EC CD 6937 


Information technology - Coded graphic character set 
for text communication - Latin alphabet 


1 Scope 

This International Standard 

a) specifies the coded representation of the characters; 

b) specifies a repertoire of the Latin alphabetic and non-alphabetic characters for the communication of text in many 
European languages using the Latin script; 

c) specifies rules for the definitions and use of graphic character subrepertoires, i.e. subsets of the specified 
character repertoire. 


2 Conformance and implementation 

2.1 Conformance 

2.1.1 Conformance of information interchange 

A coded-character-data-element (CC-data-element) within coded information for interchange is in conformance with 
this International Standard if all coded representations of characters within that CC-data-element conform to the 
mandatory requirements of this International Standard. 

A claim of conformance shall identify: 

- the subrepertoire in accordance with clause 9, if one has been adopted, 

- the 7-bit coding in accordance with Annex A, if it has been adopted. 

2.1 .2 Conformance of devices 

A device is in conformance with this International Standard if it conforms to the requirements of 2.1 .2.1 and either 
or both 2.1 .2.2 and 2.1 .2.3 below. 

2.1 .2.1 Device description 

A device that conforms to this International Standard shall be the subject of a description that identifies the means 
by which the user may supply characters to the device, or may recognize them when they are made available to 
the user, as specified respectively in 2.1 .2.2 and 2. 1.2. 3 below. 

2.1. 2.2 Originating devices 

An originating device shall allow its user to supply any sequence of characters of the character repertoire, and shall 
be capable of transmitting their coded representations within a CC-data-element. 
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2. 1.2. 3 Receiving devices 

A receiving device shall be capable of receiving and interpreting any coded representation of characters that are 
within a CC-data-element, and that conform to 2.1.1 of this International Standard, and shall make the 
corresponding characters available to its user in such a way that the user can identify them among those of the 
repertoire, and can distinguish them from each other. 

2.2 Implementation 

The use of this character set requires definitions of its implementation in various media. For example, these could 
include magnetic and optical interchangeable media and transmission channels, thus permitting interchange of data 
to take place either indirectly by means of an intermediate recording on a physical medium, or by local connection 
of various units (such as input and output devices and computers) or by means of data transmission equipment. 

The implementation of this coded character set in physical media and for transmission, taking into account the need 
for error checking, may be the subject of other International Standards. 


3 Normative references 

The following standards contain provisions which, through reference in this text, constitute provisions of this 
International Standard. At the time of publication, the editions indicated were valid. All Standards are subject to 
revision, and parties to agreements based on this International Standard are encouraged to investigate the 
possibility of applying the most recent editions of the standards indicated below. Members of I EC and ISO maintain 
registers of currently valid International Standards. 

ISO/IEC 2022:1994, Information technology - Character code structure and extension techniques. 

ISO/IEC 7350:1991, Information technology - Registration of repertoires of the graphic characters from 
ISO/IEC 10367. 

ISO/IEC 10367:1991, Information technology - Standardized coded graphic character sets for use in 8-bit 
codes. 

ISO/IEC 10538:1991, Information technology - Control functions for text communication. 

ISO/IEC 10646:1998, Information technology - Universal Multiple-Octet Coded Character Set (UCS) - Part 1: 
Architecture and Basic Multilingual Plane (BMP) (including AMD 1-9 and COR 1). 

4 Definitions 

For the purposes of this International Standard, the following definitions apply: 

4.1 active position: The character position which is to image the graphic symbol representing the next 
graphic character or relative to which the next control function is to be executed. 

4.2 bit combination: An ordered set of bits used for the representation of characters. 

4.3 character: A member of a set of elements used for the organization, control or representation of data. 

4.4 character position: The portion of a display that is imaging or is capable of imaging a graphic symbol. 

4.5 coded-character-data-element (CC-data-element): An element of interchanged information that is 
specified to consist of a sequence of coded representations of characters, in accordance with one or more identified 
standards for coded character sets. 


NOTE 1 In a communication environment in accordance with the Reference Model for Open Systems Interconnec- 
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tion of ISO 7498, a CC-data-element will form all or part of the information that corresponds to the Present- 
ation-Protocol-Data-Unit (PPDU) defined in that International Standard. 

NOTE 2 When information interchange is accomplished by means of interchangeable media, a CC-data-element 
will form all or part of the information that corresponds to the user data, and not that recorded during formatting 
and initialization. 

4.6 coded character set; code: A set of unambiguous rules that establishes a character set and the one-to-one 
relationship between the characters of the set and their bit combinations. 

4.7 code extension: The techniques for the encoding of characters that are not included in the character set 
of a given code. 

4.8 code table: A table showing the character allocated to each bit combination in a code. 

4.9 control character: A control function the coded representation of which consists of a single bit combination. 

4.1 0 control function: An element of a character set that affects the recording, processing, transmission or inter- 
pretation of data, and that has a coded representation consisting of one or more bit combinations. 

4.11 device: A component of information processing equipment which can transmit, and/or receive, coded 
information within CC-data-elements. 

NOTE 3 It may be an input/output device in the conventional sense, or a process such as an application program 
or gateway function. 

4.12 escape sequence: A string of bit combinations that are used for control purposes in code extension 
procedures. The first of these bit combinations represents the control function ESCAPE. 

NOTE 4 Formats and rules regarding the use of escape sequences are specified in ISO/IEC 2022. 

4.13 graphic character: A character, other than a control function, that has a visual representation normally 
handwritten, printed or displayed, and that has a coded representation consisting of one or more bit combinations. 

4.14 graphic symbol: A visual representation of a graphic character or of a control function. 

4.15 repertoire: A specified set of characters that are represented by one or more bit combinations of a coded 
character set. 

4.16 text: A representation of information for human comprehension that is intended for presentation in a 
two-dimensional form, for example printed on paper or displayed on a screen. 

Text consists of symbols, phrases or sentences in natural or artificial languages, pictures, diagrams and tables. 
NOTE 5 This International Standard applies only to text made up of characters. 

4.17 text communication; communication of text: The transfer of text by means of telecommunications. 

NOTE 6 In the context of this International Standard, text communication is by means of binary-coded represen- 
tations of characters. 

4.18 user: A person or other entity that invokes the services provided by a device. 

NOTE 7 This entity may be a process such as an application program if the "device" is a code convertor or a 
gateway function, for example. 

NOTE 8 The characters, as supplied by the user or made available to the user, may be in the form of codes local 
to the device, or of non-conventional visible representations, provided that 2.1.2 above is satisfied. 
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5 Notation, code table and names 

5.1 Notation 

The bits of the bit combinations of the 8-bit code are identified by bg, b 7 , bg, b 5 , b 4 , b 3 , b 2 and b^, where bg is 
the highest-order, or most significant bit and b 1 is the lowest-order, or least significant bit. 

The bit combinations may be interpreted to represent numbers in the range 0 to 255 in binary notation by attributing 
the following weights to the individual bits: 


Bit 

00 

-Q 

b 7 

m 

m 

m 

m 

b 2 

b 1 

Weight 

128 

64 

32 

16 

8 

4 

2 

1 


In this International Standard, the bit combinations are identified by notations of the form xx/yy, where xx and yy 
are numbers in the range 00 to 1 5. The correspondence between the notations of the form xx/yy and the bit 
combinations consisting of the bits bg to b-| , is as follows: 

- xx is the number represented by bg, b 7 , bg and b 5 where these bits are given the weights 8, 4, 2 and 1, 
respectively: 

- yy is the number represented by b 4 , bg, b 2 and b 1 where these bits are given the weights 8, 4, 2 and 1, 
respectively. 

The notations of the form xx/yy are the same as the ones used to identify code table positions, where xx is the 
column number and yy is the row number (see 5.2). 

5.2 Code table 

An 8-bit code table consists of 256 positions arranged in 16 columns and 16 rows. The columns and rows are 
numbered 00 to 15. 

The code table positions are identified by notations of the form xx/yy, where xx is the column number and yy is the 
row number. 

The positions of the code table are in one-to-one correspondence with the bit combinations of the code. The 
notation of a code table position, of the form xx/yy, is the same as that of the corresponding bit combination. 

5.3 Names 

This International Standard assigns one name to each character. In addition, it specifies an acronym for the three 
characters SPACE, NO-BREAK SPACE and SOFT HYPHEN and a graphic symbol for the other graphic characters. 
By convention, only capital letters, space and hyphen are used for writing the names of characters. It is intended 
that the acronym and this convention be retained in all translations of the text of this International Standard. 

The names chosen to denote graphic characters are intended to reflect their customary meaning. However, this 
International Standard does not define and does not restrict the meanings of graphic characters. Neither does it 
specify a particular style or font design for imaging the graphic characters. 
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6 Specifications of SPACE, NO-BREAK SPACE and SOFT HYPHEN 


6.1 SPACE (SP): A graphic character that has a visual representation consisting of the absence of a graphic 
symbol. Its coded representation is 02/00. 

6.2 NO-BREAK SPACE (NBSP): A graphic character, the visual representation of which consists of the ab- 
sence of a graphic symbol, for use when a line break is to be prevented in the text as presented. 

6.3 SOFT HYPHEN (SHY): A graphic character that is imaged by a graphic symbol identical with, or similar to, 
that representing HYPHEN-MINUS, for use when a line break has been established within a word. 

7 Composition of the character repertoire 

The repertoire of the graphic characters defined in this International Standard consists of 

a) SPACE (SP) 

and of 332 characters as follows 

b) Latin alphabetic characters comprising 

1) the 52 capital and small letters of the basic Latin alphabet, 

2) accented letters, the graphic representations of which consist of combinations of basic Latin letters 
with diacritical marks, 

3) special alphabetic characters which are neither basic Latin letters nor combinations of basic Latin 
letters with diacritical marks; 

c) non-alphabetic characters, such as digits, fractions, punctuation and diacritical marks, monetary symbols etc. 

The repertoire, excluding SPACE, is specified in table 4. In each table entry, the first column specifies the name 
of the character. The second column specifies its coded representation (see 8.3). 

NOTE 9 A survey of the use of Latin characters in various languages is included in Annex D. 

NOTE 10 Use of the following characters: LATIN CAPITAL LETTER L WITH MIDDLE DOT, LATIN SMALL 

LETTER L WITH MIDDLE DOT and LATIN SMALL LETTER N PRECEDED BY APOSTROPHE, is deprecated. 

8 Specification of the coded character set 

8.1 Character sets 

The coded representations of the graphic characters of the repertoire defined in this International Standard make 
use of the character SPACE and of two character sets, that is "a primary set" and a "supplementary set". 

The primary set shall consist of the graphic characters of the basic GO set of ISO/IEC 10367, represented by bit 
combinations 02/01 to 07/14. The characters of the primary set shall not be used in combination with each other 
to generate graphic characters of the repertoire defined in this International Standard. The primary set contains the 
letters of the basic Latin alphabet, some spacing diacritical marks and a number of non-alphabetic characters. 

The supplementary set contains graphic characters, represented by bit combinations 10/00 to 11/15 and 13/00 to 
15/15, and non-spacing diacritical marks, represented by bit combinations 12/00 to 12/15. The graphic characters 
consist of a number of characters used in addition to those in the primary set. 

A non-spacing diacritical mark shall be used only in combination with certain basic Latin letters, or with SPACE. 
The allowed combinations of non-spacing diacritical marks and letters are the ones needed to represent the 
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accented letters included in table 4. This set of combinations is summarized in Annex C. 


The code table for the primary and the supplementary sets of graphic characters is given in table 1 . Shaded 
positions denote bit combinations which shall not be used. 

The names of the characters in the primary set are specified in Table 2. 

The names of the characters and non-spacing diacritical marks of the supplementary set are specified in Table 3. 
In order to stress that non-spacing diacritical marks are not characters, the names given to them are printed in 
lower case italics. 

8.2 Explanations concerning the code table 

8 . 2.1 Bit combinations 10/04 and 10/06 are reserved for future standardization, and shall not be used. 

8 . 2.2 The non-spacing diacritical marks of column 12 are used only in combination with certain basic Latin letters, 
or with SPACE (see Annex C). The graphic symbols shown in coloumn 12 represent diacritical marks as separate 
graphic characters. 

8.2.3 Bit combinations 12/00, 12/09 and 12/12 are reserved for possible allocation of additional diacritical marks, 
and shall not be used. 

8 . 2.4 Bit combinations 13/08 to 13/11 and 14/05 are reserved for future standardization, and shall not be used. 

8.3 Coded representations of the graphic characters of the repertoire 

The coded representations of the graphic characters of the repertoire defined in this International Standard are 
specified in table 4. The formats of the coded representations are as follows: 

a) Accented letters 

Each accented letter is represented by a sequence of bit combinations consisting of the coded 
representation of the relevant non-spacing diacritical mark (an element of the supplementary set), 
followed by the coded representation of the relevant basic Latin letter (an element of the primary 
set). 

b) Diacritical marks as separate graphic characters 

The diacritical marks that are elements of the primary set (GRAVE ACCENT, CIRCUMFLEX ACCENT and 
TILDE) are represented as separate qraphic characters by the corresponding single bit combination in the 
range 02/01 to 07/14. 

The other ten of the diacritical marks of column 12 are represented as separate graphic characters by a 
sequence of bit combinations consisting of the coded representation of the relevant non-spacing diacritical 
mark (an element of the supplementary set), followed by the coded representation of the character SPACE, 
i.e. the bit combination 02/00. 

c) All other graphic characters of the repertoire 

Any graphic character of the repertoire, other than an accented letter or a diacritical mark as a 
separate graphic character that is not an element of the primary set, is an element of either the 
primary set or the supplementary set and is represented by the corresponding single bit 
combination in the range 02/01 to 07/14 or 10/00 to 15/15. 

Depending of the code extension techniques used, a bit combination, representing an element of either the primary 
or the supplementary set may have to be preceded by a code extension function invoking the character set 
concerned. 

NOTES Explanations concerning certain letters: 
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NOTE 1 1 Accented letter LATIN SMALL LETTER G WITH CEDILLA was named "small g with acute accent" 
in the 1983 edition of this International Standard. For compatibility purposes, the coded representation has been 
kept unchanged. The name has been aligned with that in ISO/IEC 10646-1 . The cedilla, upturned, is placed above 
"g" for presentation purposes. 

NOTE 12 There is no LATIN CAPITAL LETTER ETH in this International Standard. There is a letter named 
LATIN CAPITAL LETTER D WITH STROKE which will also serve as the capital form of Icelandic Eth, where this 
International Standard is used. It should be noted that ISO/IEC 10646, ISO/IEC 8859-1 and ISO/IEC 10367 provide 
for a LATIN CAPITAL LETTER ETH as well as a LATIN CAPITAL LETTER D WITH STROKE. 


9 Graphic character subrepertoires 

The purpose of defining character subrepertoires is to facilitate communication with equipment capable of 
presenting text using a limited set of graphic characters at one time. An example of equipment that might make 
use of subrepertoires is a text communication terminal containing an output device that has a changeable printing 
element (physical or other). However, in order to comply with the requirements of this International Standard, such 
a text communication terminal has to be capable of receiving and presenting all graphic characters of the repertoire 
in some manner, possibly using one or more alternative printing elements. 

Subrepertoires are defined in accordance with the following rules: 

a) A subrepertoire shall include the character SPACE, the 26 Latin unaccented small letters and the 26 Latin 
unaccented capital letters. 

b) A subrepertoire shall include the 10 digits. 

c) A subrepertoire shall include the following characters: 

Graphic symbol Name 

’ APOSTROPHE 

( LEFT PARENTHESIS 

) RIGHT PARENTHESIS 

, COMMA 

HYPHEN-MINUS 
FULL STOP 

/ SOLIDUS 

: COLON 

? QUESTION MARK 

+ PLUS SIGN 

= EQUALS SIGN 

d) A subrepertoire may include any other graphic characters of the repertoire defined in this International Standard. 

e) A subrepertoire shall not include any character not defined in this International Standard. 

f) Two or more graphic characters of the repertoire shall not be included as a single character in the subrepertoire. 
The procedure for registration of subrepertoires is specified in ISO/IEC 7350. 

The identifier assigned to a registered subrepertoire is intended to be used as a parameter value of the control 
function IDENTIFY GRAPHIC SUBREPERTOIRE (IGS) which is defined in ISO/IEC 10538. 


10 Identification of options 

10.1 Purpose and context of identification 

CC-data-elements conforming to an option of this International Standard are intended to form all or part of a 
composite unit of coded information that is interchanged between a sender and a recipient. The identification of 
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the options of this International Standard that have been adopted by the originator shall also be available to the 
recipient. The route by which such identification is communicated to the recipient is outside the scope of this 
International Standard. 

However, some standards for interchange of coded information may permit, or require, that the coded 
representation of the identification applicable to the CC-data-elements forms part of the interchanged information. 
This clause specifies a coded representation for the identification of options of this International Standard. Such 
coded representations form all or part of an identifying data element, which may be included in information 
interchange in accordance with the relevant standard. 

10.2 Identification of coding method 

The coding method adopted shall be identified by means of one of the following announcer sequences: 

ESC 02/00 04/10 shall identify 7-bit coding (as in Annex A); 

ESC 02/00 04/1 1 shall identify 8-bit coding. 


10.3 Identification of primary and supplementary sets 


The escape sequences used to designate the primary and the supplementary sets are: 


ESC 02/08 04/02 
ESC 02/13 05/02 
ESC 02/14 05/02 
ESC 02/15 05/02 


to designate the primary set of the present edition of this 
International Standard as the GO set (ISO-IR 6); 
to designate the supplementary set of the present edition of 
this International Standard as the G1 set (ISO-IR 156); 
to designate the supplementary set of the present edition of 
this International Standard as the G2 set; 
to designate the supplementary set of the present edition of 
this International Standard as the G3 set. 


NOTE 13 The escape sequences used to designate the primary and the supplementary sets of ISO 6937/2:1983 


are: 

ESC 02/08 04/00 
ESC 02/09 06/12 

ESC 02/10 06/12 
ESC 02/11 06/12 


to designate the primary set as the GO set (ISO-IR 2); 
to designate the supplementary set as the G1 set (ISO-IR 
90); 

to designate the supplementary set as the G2 set; 
to designate the supplementary set as the G3 set. 


10.4 Identification of subrepertoire 


The subrepertoire adopted shall be identified by the control function IDENTIFY GRAPHIC SUBREPERTOIRE (IGS) 
which is defined in ISO/IEC 10538. Parameter values identifying graphic character subrepertoires are registered 
in accordance with ISO/IEC 7350. 
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Table 1 - Primary and supplementary sets of graphic characters and non-spacing diacritical marks for 

text communication 

(coding when represented by bit combinations 02/01 to 07/14 and 10/00 to 15/15 of an 8-bit code) 


00 01 02 03 04 05 06 07 08 09 10 11 12 13 14 15 
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Table 2 - Specification of the primary character set in an 8-bit code 


Bit 

comb. 

Name 

Bit 

comb. 

Name 



05/00 

LATIN CAPITAL LETTER P 

02/01 

EXCLAMATION MARK 

05/01 

LATIN CAPITAL LETTER Q 

02/02 

QUOTATION MARK 

05/02 

LATIN CAPITAL LETTER R 

02/03 

NUMBER SIGN 

05/03 

LATIN CAPITAL LETTER S 

02/04 

DOLLAR SIGN 

05/04 

LATIN CAPITAL LETTER T 

02/05 

PERCENT SIGN 

05/05 

LATIN CAPITAL LETTER U 

02/06 

AMPERSAND 

05/06 

LATIN CAPITAL LETTER V 

02/07 

APOSTROPHE 

05/07 

LATIN CAPITAL LETTER W 

02/08 

LEFT PARENTHESIS 

05/08 

LATIN CAPITAL LETTER X 

02/09 

RIGHT PARENTHESIS 

05/09 

LATIN CAPITAL LETTER Y 

02/10 

ASTERISK 

05/10 

LATIN CAPITAL LETTER Z 

02/11 

PLUS SIGN 

05/11 

LEFT SQUARE BRACKET 

02/12 

COMMA 

05/12 

REVERSE SOLIDUS 

02/13 

HYPHEN-MINUS 

05/13 

RIGHT SQUARE BRACKET 

02/14 

FULL STOP 

05/14 

CIRCUMFLEX ACCENT 

02/15 

SOLIDUS 

05/15 

LOW LINE 





03/00 

DIGIT ZERO 

06/00 

GRAVE ACCENT 

03/01 

DIGIT ONE 

06/01 

LATIN SMALL LETTER A 

03/02 

DIGIT TWO 

06/02 

LATIN SMALL LETTER B 

03/03 

DIGIT THREE 

06/03 

LATIN SMALL LETTER C 

03/04 

DIGIT FOUR 

06/04 

LATIN SMALL LETTER D 

03/05 

DIGIT FIVE 

06/05 

LATIN SMALL LETTER E 

03/06 

DIGIT SIX 

06/06 

LATIN SMALL LETTER F 

03/07 

DIGIT SEVEN 

06/07 

LATIN SMALL LETTER G 

03/08 

DIGIT EIGHT 

06/08 

LATIN SMALL LETTER H 

03/09 

DIGIT NINE 

06/09 

LATIN SMALL LETTER 1 

03/10 

COLON 

06/10 

LATIN SMALL LETTER J 

03/11 

SEMICOLON 

06/11 

LATIN SMALL LETTER K 

03/12 

LESS-THAN SIGN 

06/12 

LATIN SMALL LETTER L 

03/13 

EQUALS SIGN 

06/13 

LATIN SMALL LETTER M 

03/14 

GREATER-THAN SIGN 

06/14 

LATIN SMALL LETTER N 

03/15 

QUESTION MARK 

06/15 

LATIN SMALL LETTER O 





04/00 

COMMERCIAL AT 

07/00 

LATIN SMALL LETTER P 

04/01 

LATIN CAPITAL LETTER A 

07/01 

LATIN SMALL LETTER Q 

04/02 

LATIN CAPITAL LETTER B 

07/02 

LATIN SMALL LETTER R 

04/03 

LATIN CAPITAL LETTER C 

07/03 

LATIN SMALL LETTER S 

04/04 

LATIN CAPITAL LETTER D 

07/04 

LATIN SMALL LETTER T 

04/05 

LATIN CAPITAL LETTER E 

07/05 

LATIN SMALL LETTER U 

04/06 

LATIN CAPITAL LETTER F 

07/06 

LATIN SMALL LETTER V 

04/07 

LATIN CAPITAL LETTER G 

07/07 

LATIN SMALL LETTER W 

04/08 

LATIN CAPITAL LETTER H 

07/08 

LATIN SMALL LETTER X 

04/09 

LATIN CAPITAL LETTER 1 

07/09 

LATIN SMALL LETTER Y 

04/10 

LATIN CAPITAL LETTER J 

07/10 

LATIN SMALL LETTER Z 

04/11 

LATIN CAPITAL LETTER K 

07/11 

LEFT CURLY BRACKET 

04/12 

LATIN CAPITAL LETTER L 

07/12 

VERTICAL LINE 

04/13 

LATIN CAPITAL LETTER M 

07/13 

RIGHT CURLY BRACKET 

04/14 

LATIN CAPITAL LETTER N 

07/14 

TILDE 

04/15 

LATIN CAPITAL LETTER O 






Table 3 - Specification of the supplementary character set in an 8-bit code 


Bit 

comb. 

Name 

Bit 

comb. 

Name 

10/00 

NO-BREAK SPACE 

13/00 

HORIZONTAL BAR 


iSSSSS 


This position shall not be used 


his position shall not be used 




r is* Tit ■ 




LEFT DOUBLE QUOTATION MARK 


EFT-POINTING DOUBLE AN 
QUOTATION MARK 


I ■ a a iV/J :l »TcTJ :l :t»Wi 


UPWARDS ARROW 


RIGHTWARDS ARROW 


mumBvmtmmamii 


DEGREE SIGN 


liSB 

■Ktuaa:w»t:iiaai;i:ia 


MULTIPLICATION SIGN 


MICR 


DIVISION SIGN 



BROKEN BAR 


This position shall not be used 




This position shall not be used 


This position shall not be used 


MiirtyjsiasgwiiwiiKWiiiaawiiii 


VULGAR FRACTION THREE EIGHTHS 


>v.ncT J: ia ; rjHiw^d^=«arctiii:« 

EVEiaasmmmmiMmm 


OHM SIGN 


UMMSiwasaMmamiEjam 


LATIN CAPITAL LETTER H WITH STROKE 


This position shall not be use 



imaMmaaasmmma^m 


QUOTATION MARK 


smMsmmsomas^mmm 


LATIN CAPITAL LETTER L WITH STROKE 









(This position shall not be used) 


non-spacing grave accent 


non-spacing grave accent 


non-spacing circumflex accent 


non-spacing tilde 


non-spacing macron 


non-spacing breve 


non-spacing dot above 


non-spacing diaeresis 


is position shall not be use 


non -spacing ring above 


non-spacing cedilla 


(This position shall not be used) 


non-spacing double acute accent 


IHE 



LATIN SMALL LETTER N PRECEDED B 
APOSTROPHE 


LATIN SMALL LETTER KRA 


LATIN SMALL LETTER AE 


■MiiUBitHWiaauaaiir 



LATIN SMALL LETTER H WITH STROKE 




LATIN SMALL LETTER L WITH STROKE 


■RMisia^iBiaiia:wa:rj:fi 


LATIN SMALL LETTER THORN 


LATIN SMALL LETTER T WITH STROKE 
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Table 4 - Specification of the repertoire 


Name 





AMPERSAND 


02/06 

APOSTROPHE 


02/07 

ASTERISK 


02/10 

BREVE 

12/06 

02/00 

BROKEN BAR 

13/07 


CARON 

12/15 

02/00 

CEDILLA 

12/11 

02/00 

CENT SIGN 

10/02 


CIRCUMFLEX ACCENT 


05/14 

COLON 


03/10 

COMMA 


02/12 

COMMERCIAL AT 


04/00 

COPYRIGHT SIGN 

13/03 


CURRENCY SIGN 

10/08 


DEGREE SIGN 

11/00 


DIAERESIS 

12/08 

02/00 

DIGIT EIGHT 


03/08 

DIGIT FIVE 


03/05 

DIGIT FOUR 


03/04 

DIGIT NINE 


03/09 

DIGIT ONE 


03/01 

DIGIT SEVEN 


03/07 

DIGIT SIX 


03/06 

DIGIT THREE 


03/03 

DIGIT TWO 


03/02 

DIGIT ZERO 


03/00 

DIVISION SIGN 

11/08 


DOLLAR SIGN 


02/04 

DOT ABOVE 

12/07 

02/00 

DOUBLE ACUTE ACCENT 

12/13 

02/00 

DOWNWARDS ARROW 

10/15 


EQUALS SIGN 


03/13 

EXCLAMATION MARK 


02/01 

FEMININE ORDINAL INDICATOR 

14/03 


FULL STOP 


02/14 

GRAVE ACCENT 


06/00 

GREATER-THAN SIGN 


03/14 

HORIZONTAL BAR 

13/00 


HYPHEN-MINUS 


02/13 

INVERTED EXCLAMATION MARK 

10/01 


INVERTED QUESTION MARK 

11/15 


LATIN CAPITAL LETTER A 


04/01 

LATIN CAPITAL LETTER A WITH ACUTE 

12/02 

04/01 

LATIN CAPITAL LETTER A WITH BREVE 

12/06 

04/01 

LATIN CAPITAL LETTER A WITH CIRCUMFLEX 

12/03 

04/01 

LATIN CAPITAL LETTER A WITH DIAERESIS 

12/08 

04/01 

LATIN CAPITAL LETTER A WITH GRAVE 

12/01 

04/01 

LATIN CAPITAL LETTER A WITH MACRON 

12/05 

04/01 

LATIN CAPITAL LETTER A WITH OGONEK 

12/14 

04/01 

LATIN CAPITAL LETTER A WITH RING ABOVE 

12/10 

04/01 
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Table 4 - (continued) 


Name 

Coded representation 

LATIN CAPITAL LETTER A WITH TILDE 

12/04 

04/01 

LATIN CAPITAL LETTER AE 1 

14/01 


LATIN CAPITAL LETTER B 


04/02 

LATIN CAPITAL LETTER C 


04/03 

LATIN CAPITAL LETTER C WITH ACUTE 

12/02 

04/03 

LATIN CAPITAL LETTER C WITH CARON 

12/15 

04/03 

LATIN CAPITAL LETTER C WITH CEDILLA 

12/11 

04/03 

LATIN CAPITAL LETTER C WITH CIRCUMFLEX 

12/03 

04/03 

LATIN CAPITAL LETTER C WITH DOT ABOVE 

12/07 

04/03 

LATIN CAPITAL LETTER D 


04/04 

LATIN CAPITAL LETTER D WITH CARON 

12/15 

04/04 

LATIN CAPITAL LETTER D WITH STROKE 

14/02 


LATIN CAPITAL LETTER E 


04/05 

LATIN CAPITAL LETTER E WITH ACUTE 

12/02 i 

04/05 

LATIN CAPITAL LETTER E WITH CARON 

12/15 

04/05 

LATIN CAPITAL LETTER E WITH CIRCUMFLEX 

12/03 

04/05 

LATIN CAPITAL LETTER E WITH DIAERESIS 

12/08 

04/05 

LATIN CAPITAL LETTER E WITH DOT ABOVE 

12/07 

04/05 

LATIN CAPITAL LETTER E WITH GRAVE 

12/01 

04/05 

LATIN CAPITAL LETTER E WITH MACRON 

12/05 

04/05 

LATIN CAPITAL LETTER E WITH OGONEK 

12/14 

04/05 

LATIN CAPITAL LETTER ENG 

14/14 


LATIN CAPITAL LETTER F 


04/06 

LATIN CAPITAL LETTER G 


04/07 

LATIN CAPITAL LETTER G WITH BREVE 

12/06 

04/07 

LATIN CAPITAL LETTER G WITH CEDILLA 

12/11 

04/07 

LATIN CAPITAL LETTER G WITH CIRCUMFLEX 

12/03 

04/07 

LATIN CAPITAL LETTER G WITH DOT ABOVE 

12/07 

04/07 

LATIN CAPITAL LETTER H 


04/08 

LATIN CAPITAL LETTER H WITH CIRCUMFLEX 

12/03 

04/08 

LATIN CAPITAL LETTER H WITH STROKE 

14/04 


LATIN CAPITAL LETTER 1 


04/09 

LATIN CAPITAL LETTER 1 WITH ACUTE 

12/02 

04/09 

LATIN CAPITAL LETTER 1 WITH CIRCUMFLEX 

12/03 

04/09 

LATIN CAPITAL LETTER 1 WITH DIAERESIS 

12/08 

04/09 

LATIN CAPITAL LETTER 1 WITH DOT ABOVE 

12/07 

04/09 

LATIN CAPITAL LETTER 1 WITH GRAVE 

12/01 

04/09 

LATIN CAPITAL LETTER 1 WITH MACRON 

12/05 

04/09 

LATIN CAPITAL LETTER 1 WITH OGONEK 

12/14 

04/09 

LATIN CAPITAL LETTER 1 WITH TILDE 

12/04 

04/09 

LATIN CAPITAL LETTER J 


04/10 

LATIN CAPITAL LETTER J WITH CIRCUMFLEX 

12/03 

04/10 

LATIN CAPITAL LETTER K 


04/11 

LATIN CAPITAL LETTER K WITH CEDILLA 

12/11 

04/11 

LATIN CAPITAL LETTER L 


04/12 

LATIN CAPITAL LETTER L WITH ACUTE 

12/02 

04/12 

LATIN CAPITAL LETTER L WITH CARON 

12/15 

04/12 

LATIN CAPITAL LETTER L WITH CEDILLA 

12/11 

04/12 

LATIN CAPITAL LETTER L WITH MIDDLE DOT 

14/07 


LATIN CAPITAL LETTER L WITH STROKE 

14/08 


LATIN CAPITAL LETTER M 


04/13 

NOTE 1 This letter was named LATIN CAPITAL LIGATURE A E in the 1994 edition of 

this International Standard. The name has been aligned with that in ISO/IEC 10646-1. 




Table 4 - (continued) 




I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
I CAPITAL 
CAPITAL 
I CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 
CAPITAL 


LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 

LETTER 


N WITH 
N WITH 
N WITH 
N WITH 
O 

O WITH 
O WITH 
O WITH 
O WITH 
O WITH 
O WITH 
O WITH 
O WITH 
P 
Q 
R 

R WITH 
R WITH 
R WITH 
S 

S WITH 
S WITH 
S WITH 
S WITH 
T 

T WITH 
T WITH 
T WITH 
THORN 
U 

U WITH 
U WITH 
U WITH 
U WITH 
U WITH 
U WITH 
U WITH 
U WITH 
U WITH 
U WITH 

Y 
W 

W WITH 
X 

Y 

Y WITH 

Y WITH 

Y WITH 
Z 


ACUTE 

CARON 

CEDILLA 

TILDE 

ACUTE 

CIRCUMFLEX 

DIAERESIS 

DOUBLE ACUTE 

GRAVE 

MACRON 

STROKE 

TILDE 


ACUTE 

CARON 

CEDILLA 

ACUTE 

CARON 

CEDILLA 

CIRCUMFLEX 

CARON 

CEDILLA 

STROKE 


ACUTE 

BREVE 

CIRCUMFLEX 

DIAERESIS 

DOUBLE ACUTE 

GRAVE 

MACRON 

OGONEK 

RING ABOVE 

TILDE 


CIRCUMFLEX 


ACUTE 

CIRCUMFLEX 

DIAERESIS 




14 






Table 4 - (continued) 


Name 

Coded representation 

LATIN CAPITAL LETTER Z WITH ACUTE 

12/02 

05/10 

LATIN CAPITAL LETTER Z WITH CARON 

12/15 

05/10 

LATIN CAPITAL LETTER Z WITH DOT ABOVE 

12/07 

05/10 

LATIN CAPITAL LIGATURE IJ 2 

14/06 


LATIN CAPITAL LIGATURE OE 2 

14/10 


LATIN SMALL LETTER A 


06/01 

LATIN SMALL LETTER A WITH ACUTE 

12/02 

06/01 

LATIN SMALL LETTER A WITH BREVE 

12/06 

06/01 

LATIN SMALL LETTER A WITH CIRCUMFLEX 

12/03 

06/01 

LATIN SMALL LETTER A WITH DIAERESIS 

12/08 

06/01 

LATIN SMALL LETTER A WITH GRAVE 

12/01 

06/01 

LATIN SMALL LETTER A WITH MACRON 

12/05 

06/01 

LATIN SMALL LETTER A WITH OGONEK 

12/14 

06/01 

LATIN SMALL LETTER A WITH RING ABOVE 

12/10 

06/01 

LATIN SMALL LETTER A WITH TILDE 

12/04 

06/01 

LATIN SMALL LETTER AE 3 

15/01 


LATIN SMALL LETTER B 


06/02 

LATIN SMALL LETTER C 


06/03 

LATIN SMALL LETTER C WITH ACUTE 

12/02 

06/03 

LATIN SMALL LETTER C WITH CARON 

12/15 

06/03 

LATIN SMALL LETTER C WITH CEDILLA 

12/11 

06/03 

LATIN SMALL LETTER C WITH CIRCUMFLEX 

12/03 

06/03 

LATIN SMALL LETTER C WITH DOT ABOVE 

12/07 

06/03 

LATIN SMALL LETTER D 


06/04 

LATIN SMALL LETTER D WITH CARON 

12/15 

06/04 

LATIN SMALL LETTER D WITH STROKE 

15/02 


LATIN SMALL LETTER DOTLESS 1 

15/05 


LATIN SMALL LETTER E 


06/05 

LATIN SMALL LETTER E WITH ACUTE 

12/02 

06/05 

LATIN SMALL LETTER E WITH CARON 

12/15 

06/05 

LATIN SMALL LETTER E WITH CIRCUMFLEX 

12/03 

06/05 

LATIN SMALL LETTER E WITH DIAERESIS 

12/08 

06/05 

LATIN SMALL LETTER E WITH DOT ABOVE 

12/07 

06/05 

LATIN SMALL LETTER E WITH GRAVE 

12/01 

06/05 

LATIN SMALL LETTER E WITH MACRON 

12/05 

06/05 

LATIN SMALL LETTER E WITH OGONEK 

12/14 

06/05 

LATIN SMALL LETTER ENG 

15/14 


LATIN SMALL LETTER ETH 

15/03 


LATIN SMALL LETTER F 


06/06 

LATIN SMALL LETTER G 


06/07 

LATIN SMALL LETTER G WITH BREVE 

12/06 

06/07 


NOTE 2 In the Dutch language, LATIN CAPITAL LIGATURE IJ is considered as a letter, and 
in the French language LATIN CAPITAL LIGATURE OE is considered a letter. 


NOTE 3 This letter was named LATIN SMALL LIGATURE A E in the 1994 edition of this 
International Standard. The name has been aligned with that in ISO/I EC 10646-1. 




Table 4 - (continued) 


Name 

Coded representation 

LATIN SMALL LETTER Q WITH CEDILLA 4 

12/02 

06/07 

LATIN SMALL LETTER G WITH CIRCUMFLEX 

12/03 

06/07 

LATIN SMALL LETTER G WITH DOT ABOVE 

12/07 

06/07 

LATIN SMALL LETTER H 


06/08 

LATIN SMALL LETTER H WITH CIRCUMFLEX 

12/03 

06/08 

LATIN SMALL LETTER H WITH STROKE 

15/04 


LATIN SMALL LETTER 1 


06/09 

LATIN SMALL LETTER 1 WITH ACUTE 

12/02 

06/09 

LATIN SMALL LETTER 1 WITH CIRCUMFLEX 

12/03 

06/09 

LATIN SMALL LETTER 1 WITH DIAERESIS 

12/08 

06/09 

LATIN SMALL LETTER 1 WITH GRAVE 

12/01 

06/09 

LATIN SMALL LETTER 1 WITH MACRON 

12/05 

06/09 

LATIN SMALL LETTER 1 WITH OGONEK 

12/14 

06/09 

LATIN SMALL LETTER 1 WITH TILDE 

12/04 

06/09 

LATIN SMALL LETTER J 


06/10 

LATIN SMALL LETTER J WITH CIRCUMFLEX 

12/03 

06/10 

LATIN SMALL LETTER K 


06/11 

LATIN SMALL LETTER K WITH CEDILLA 

12/11 

06/11 

LATIN SMALL LETTER KRA 

15/00 


LATIN SMALL LETTER L 


06/12 

LATIN SMALL LETTER L WITH ACUTE 

12/02 

06/12 

LATIN SMALL LETTER L WITH CARON 

12/15 

06/12 

LATIN SMALL LETTER L WITH CEDILLA 

12/11 

06/12 

LATIN SMALL LETTER L WITH MIDDLE DOT 

15/07 


LATIN SMALL LETTER L WITH STROKE 

15/08 


LATIN SMALL LETTER M 


06/13 

LATIN SMALL LETTER N 


06/14 

LATIN SMALL LETTER N PRECEDED BY APOSTROPHE 

14/15 


LATIN SMALL LETTER N WITH ACUTE 

12/02 

06/14 

LATIN SMALL LETTER N WITH CARON 

12/15 

06/14 

LATIN SMALL LETTER N WITH CEDILLA 

12/11 

06/14 

LATIN SMALL LETTER N WITH TILDE 

12/04 

06/14 

LATIN SMALL LETTER O 


06/15 

LATIN SMALL LETTER 0 WITH ACUTE 

12/02 

06/15 

LATIN SMALL LETTER 0 WITH CIRCUMFLEX 

12/03 

06/15 

LATIN SMALL LETTER 0 WITH DIAERESIS 

12/08 

06/15 

LATIN SMALL LETTER 0 WITH DOUBLE ACUTE 

12/13 

06/15 

LATIN SMALL LETTER 0 WITH GRAVE 

12/01 

06/15 

LATIN SMALL LETTER O WITH MACRON 

12/05 

06/15 

LATIN SMALL LETTER O WITH STROKE 

15/09 


LATIN SMALL LETTER O WITH TILDE 

12/04 

06/15 

LATIN SMALL LETTER P 


07/00 

LATIN SMALL LETTER Q 


07/01 

NOTE 4 Accented letter LATIN SMALL LETTER G WITH CEDILLA was named "small g with 

acute accent" in the 1983 edition of this International Standard. For compatibility purposes, the 

coded representation has been kept unchanged. The name has been aligned with that in ISO/IEC 

10646-1. 
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Table 4 - (continued) 


Name 

Coded representation 

LATIN SMALL LETTER R 


07/02 

LATIN SMALL LETTER R WITH ACUTE 

12/02 

07/02 

LATIN SMALL LETTER R WITH CARON 

12/15 

07/02 

LATIN SMALL LETTER R WITH CEDILLA 

12/11 

07/02 

LATIN SMALL LETTER S 


07/03 

LATIN SMALL LETTER S WITH ACUTE 

12/02 

07/03 

LATIN SMALL LETTER S WITH CARON 

12/15 

07/03 

LATIN SMALL LETTER S WITH CEDILLA 

12/11 

07/03 

LATIN SMALL LETTER S WITH CIRCUMFLEX 

12/03 

07/03 

LATIN SMALL LETTER SHARP S 

15/11 


LATIN SMALL LETTER T 


07/04 

LATIN SMALL LETTER T WITH CARON 

12/15 

07/04 

LATIN SMALL LETTER T WITH CEDILLA 

12/11 

07/04 

LATIN SMALL LETTER T WITH STROKE 

15/13 


LATIN SMALL LETTER THORN 

15/12 


LATIN SMALL LETTER U 


07/05 

LATIN SMALL LETTER U WITH ACUTE 

12/02 

07/05 

LATIN SMALL LETTER U WITH BREVE 

12/06 

07/05 

LATIN SMALL LETTER U WITH CIRCUMFLEX 

12/03 

07/05 

LATIN SMALL LETTER U WITH DIAERESIS 

12/08 

07/05 

LATIN SMALL LETTER U WITH DOUBLE ACUTE 

12/13 

07/05 

LATIN SMALL LETTER U WITH GRAVE 

12/01 

07/05 

LATIN SMALL LETTER U WITH MACRON 

12/05 

07/05 

LATIN SMALL LETTER U WITH OGONEK 

12/14 

07/05 

LATIN SMALL LETTER U WITH RING ABOVE 

12/10 

07/05 

LATIN SMALL LETTER U WITH TILDE 

12/04 

07/05 

LATIN SMALL LETTER V 


07/06 

LATIN SMALL LETTER W 


07/07 

LATIN SMALL LETTER W WITH CIRCUMFLEX 

12/03 

07/07 

LATIN SMALL LETTER X 


07/08 

LATIN SMALL LETTER Y 


07/09 

LATIN SMALL LETTER Y WITH ACUTE 

12/02 

07/09 

LATIN SMALL LETTER Y WITH CIRCUMFLEX 

12/03 

07/09 

LATIN SMALL LETTER Y WITH DIAERESIS 

12/08 

07/09 

LATIN SMALL LETTER Z 


07/10 

LATIN SMALL LETTER Z WITH ACUTE 

12/02 

07/10 

LATIN SMALL LETTER Z WITH CARON 

12/15 

07/10 

LATIN SMALL LETTER Z WITH DOT ABOVE 

12/07 

07/10 

LATIN SMALL LIGATURE IJ 5 

15/06 


LATIN SMALL LIGATURE OE 5 

15/10 


LEFT CURLY BRACKET 


07/11 

LEFT DOUBLE QUOTATION MARK 

10/10 


NOTE 5 In the Dutch language, LATIN SMALL LIGATURE IJ is considered as a letter, and in the 

French language LATIN SMALL LIGATURE OE is considered a letter. 





Table 4 - (concluded) 


Name 


LEFT PARENTHESIS 


02/08 

LEFT-POINTING DOUBLE ANGLE QUOTATION MARK 

10/11 


LEFT SINGLE QUOTATION MARK 

10/09 


LEFT SQUARE BRACKET 


05/11 

LEFTWARDS ARROW 

10/12 


LESS-THAN SIGN 


03/12 

LOW LINE 


05/15 

MACRON 

12/05 

02/00 

MASCULINE ORDINAL INDICATOR 

14/11 


MICRO SIGN 

11/05 


MIDDLE DOT 

11/07 


MULTIPLICATION SIGN 

11/04 


EIGHTH NOTE 

13/05 


NO-BREAK SPACE 

10/00 


NOT SIGN 

13/06 


NUMBER SIGN 


02/03 

OGONEK 

12/14 

02/00 

OHM SIGN 

14/00 


PERCENT SIGN 


02/05 

PILCROW SIGN 

11/06 


PLUS SIGN 


02/11 

PLUS-MINUS SIGN 

11/01 


POUND SIGN 

10/03 


QUESTION MARK 


03/15 

QUOTATION MARK 


02/02 

REGISTERED SIGN 

13/02 


REVERSE SOLIDUS 


05/12 

RIGHT CURLY BRACKET 


07/13 

RIGHT DOUBLE QUOTATION MARK 

11/10 


RIGHT PARENTHESIS 


02/09 

RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK 

11/11 


RIGHT SINGLE QUOTATION MARK 

11/09 


RIGHT SQUARE BRACKET 


05/13 

RIGHTWARDS ARROW 

10/14 


RING ABOVE 

12/10 

02/00 

SECTION SIGN 

10/07 


SEMICOLON 


03/11 

SOFT HYPHEN 

15/15 


SOLIDUS 


02/15 

SPACE 

02/00 


SUPERSCRIPT ONE 

13/01 


SUPERSCRIPT THREE 

11/03 


SUPERSCRIPT TWO 

11/02 


TILDE 


07/14 

TRADE MARK SIGN 

13/04 


UPWARDS ARROW 

10/13 


VERTICAL LINE 


07/12 

VULGAR FRACTION FIVE EIGHTHS 

13/14 


VULGAR FRACTION ONE EIGHTH 

13/12 


VULGAR FRACTION ONE HALF 

11/13 


VULGAR FRACTION ONE QUARTER 

11/12 


VULGAR FRACTION SEVEN EIGHTHS 

13/15 


VULGAR FRACTION THREE EIGHTHS 

13/13 


VULGAR FRACTION THREE QUARTERS 

11/14 


YEN SIGN 

10/05 
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Annex A 

(normative) 

7-bit code 


This Annex specifies the 7-bit code for the character sets of this International Standard. 


Notation (see 5.1): The bits of the bit combinations of the 7-bit code are identified by by, bg, bg, b 4 , bg, b 2 and 
b-j, where by is the highest-order, or most significant bit and b 1 is the lowest-order, or least significant bit. 

The bit combinations may be interpreted to represent numbers in the range 0 to 127 in binary notation by attributing 
the following weights to the individual bits: 


Bit 

m 

m 

Hi 

m 


b 2 

b 1 

Weight 

64 

32 

16 

8 

4 

2 

1 


In this International Standard, the bit combinations are identified by notations of the form xx/yy, where xx is a 
number in the range 00 to 07 and yy a number in the range 00 to 15. The correspondence between the notations 
of the form xx/yy and the bit combinations consisting of the bits by to b^ , is as follows: 

- xx is the number represented by by, bg and bg where these bits are given the weights 4, 2 and 1 , respectively; 

- yy is the number represented by b 4 , bg, b 2 and b 1 where these bits are given the weights 8, 4, 2 and 1, 
respectively. 

The notations of the form xx/yy are the same as the ones used to identify code table positions, where xx is the 
column number and yy is the row number (see 5.2). 

Code table (see 5.2): A 7-bit code table consists of 128 positions arranged in 8 columns and 16 rows. The 
columns are numbered 00 to 07 and the rows are numbered 00 to 15. 

GO, G1, G2 and G3 sets: In a 7-bit code, the elements of a GO set are represented by bit combinations in the 
range 02/01 to 07/14, and the elements of a G1, G2 or G3 set of graphic characters are also represented by bit 
combinations in the range 02/00 to 07/15 after invocation by the appropriate code extension function in accordance 
with ISO 2022. 
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Annex B 

(informative) 


Method of definition of short identifiers of this International Standard 


Characters are identified by their names as specified in the repertoire. In certain applications, these names may 
be too long for referencing. To serve this situation, a system of short identifiers is introduced. 

NOTE 14 In the 1983 edition of this International Standard, these short identifiers were called "identifiers", and 
intended to identify characters. This practice is not continued in this International Standard, and is in fact 
deprecated. 


For the purpose of this International Standard, a method has been developed which allows for a short form of 
identification of graphic characters. The method is shown in figure B.l. 

Each short identifier consists of two capital letters and two digits. 

The first letter indicates an alphabet or a character category (in the case of a non-alphabetic graphic character). 
Only L, N and S are used in this Annex, the other capital letters are reserved for future use. 

The second letter indicates a letter of the alphabet or, in the case of a non-alphabetic graphic character, the group 
of characters. 


In the case of an alphabetic character, the first digit indicates the presence of a diacritical mark or a special form, 
and the second digit indicates whether it is a capital or a small letter. The digits have no special meaning when 
the short identifier begins with an N or an S. 

The numbering is used in a consistent manner so that each diacritical mark is always given the same number. 
The numbering principle is shown in figure B.2. 


Table B.l provides the lists of short identifiers and names for the graphic characters of the repertoire defined in 
this International Standard. 


NOTE 15: The following short identifiers have been changed from the second edition to the third edition: 


old new 
LA51 LA61 
LA52 LA62 
LG1 1 LG41 
LI51 LI63 
LI52 LI64 
L051 L063 
L052 L064 


character 

LATIN CAPITAL LETTER AE 

LATIN SMALL LETTER AE 

LATIN CAPITAL LETTER G WITH CEDILLA 

LATIN CAPITAL LIGATURE IJ 

LATIN SMALL LIGATURE IJ 

LATIN CAPITAL LIGATURE OE 

LATIN SMALL LIGATURE OE 


and the catogory LIGATURE has been removed from the method of definition of short identifiers. 
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A 0 1 


Por alphabetic characters: 
odd digit = small letter; 
even digit = capital letter. 

If N or S in first position: 
no special meaning. 


| For alphabetic characters: 

; 0 = letter without diacritical mark; 

; 1 to 3 = letter with diacrital mark above it; 

! 4 = letter with diacritical mark below it; 

i 6 = special form. 

I If N or S in first position: 

no special meaning. 

For alphabetic characters: 

A to Z = the respective letter of the Latin alphabet. 

If N in first position: 

D = digit; 

F = fraction; 

S = subscript or superscript. 

If S in first position: 

A = arithmetic sign; 

C = currency sign; 

D = diacritical mark; 

P = punctuation mark; 

M = other symbol (miscellaneous). 


JEor all graphic characters: 

L = Latin alphabetic character; 
N = numeric graphic character; 
S = special graphic character. 


Figure B.1 - Method of definition of short identifiers 



No diacritical mark 

Small 

01 

Capital 

02 

ACUTE ACCENT 

11 

12 

GRAVE ACCENT 

13 

14 

CIRCUMFLEX ACCENT 

15 

16 

DIAERESIS 

17 

18 

TILDE 

19 

20 

CARON 

21 

22 

BREVE 

23 

24 

DOUBLE ACUTE ACCENT 

25 

26 

RING ABOVE 

27 

28 

DOT ABOVE 

29 

30 

MACRON 

31 

32 

CEDILLA 

41 

42 

OGONEK 

43 

44 

Special forms: 

AE, D, H, L, T WITH STROKE 

61 

62 

DOTLESS 1 

61 

- 

0 WITH STROKE 

61 

62 

KRA 

61 

- 

ENG 

61 

62 

SHARP S 

61 

- 

ETH (see note 12 in clause 8.3) 

63 

62 

L WITH MIDDLE DOT 

63 

64 

N PRECEDED BY APOSTROPHE 

63 

- 

THORN 

63 

64 

IJ, OE 

63 

64 

Figure B.2 - Numbering principle for alphabetic characters 



Table B.1 - List of short identifiers for the repertoire in alphabetic order 

of character names 


ID 

Name 

SD11 

ACUTE ACCENT 

SM03 

AMPERSAND 

SP05 

APOSTROPHE 

SM04 

ASTERISK 

SD23 

BREVE 

SM65 

BROKEN BAR 

SD21 

CARON 

SD41 

CEDILLA 

SC04 

CENT SIGN 

SD15 

CIRCUMFLEX ACCENT 

SP13 

COLON 

SP08 

COMMA 

SM05 

COMMERCIAL AT 

SM52 

COPYRIGHT SIGN 

SC01 

CURRENCY SIGN 

SMI 9 

DEGREE SIGN 

SD17 

DIAERESIS 

ND08 

DIGIT EIGHT 

ND05 

DIGIT FIVE 

ND04 

DIGIT FOUR 

ND09 

DIGIT NINE 

ND01 

DIGIT ONE 

ND07 

DIGIT SEVEN 

ND06 

DIGIT SIX 

ND03 

DIGIT THREE 

ND02 

DIGIT TWO 

ND1 0 

DIGIT ZERO 

SA06 

DIVISION SIGN 

SC03 

DOLLAR SIGN 

SD29 

DOT ABOVE 

SD25 

DOUBLE ACUTE ACCENT 

SM93 

EIGHTH NOTE 

SM33 

DOWNWARDS ARROW 

SA04 

EQUALS SIGN 

SP02 

EXCLAMATION MARK 

SM21 

FEMININE ORDINAL INDICATOR 

SP11 

FULL STOP 

SD13 

GRAVE ACCENT 

SA05 

GREATER-THAN SIGN 

SM12 

HORIZONTAL BAR 

SP10 

HYPHEN-MINUS 

SP03 

INVERTED EXCLAMATION MARK 

SP16 

INVERTED QUESTION MARK 

LA02 

LATIN CAPITAL LETTER A 

LAI 2 

LATIN CAPITAL LETTER A WITH ACUTE 

LA24 

LATIN CAPITAL LETTER A WITH BREVE 

LAI 6 

LATIN CAPITAL LETTER A WITH CIRCUMFLEX 

LAI 8 

LATIN CAPITAL LETTER A WITH DIAERESIS 

LAI 4 

LATIN CAPITAL LETTER A WITH GRAVE 




Table B.1 - (continued) 


ID 

Name 

LA32 

LATIN CAPITAL LETTER A WITH MACRON 

LA44 

LATIN CAPITAL LETTER A WITH OGONEK 

LA28 

LATIN CAPITAL LETTER A WITH RING ABOVE 

LA20 

LATIN CAPITAL LETTER A WITH TILDE 

LA62 

LATIN CAPITAL LETTER AE 

LB02 

LATIN CAPITAL LETTER B 

LC02 

LATIN CAPITAL LETTER C 

LC12 

LATIN CAPITAL LETTER C WITH ACUTE 

LC22 

LATIN CAPITAL LETTER C WITH CARON 

LC42 

LATIN CAPITAL LETTER C WITH CEDILLA 

LC16 

LATIN CAPITAL LETTER C WITH CIRCUMFLEX 

LC30 

LATIN CAPITAL LETTER C WITH DOT ABOVE 

LD02 

LATIN CAPITAL LETTER D 

LD22 

LATIN CAPITAL LETTER D WITH CARON 

LD62 

LATIN CAPITAL LETTER D WITH STROKE 

LE02 

LATIN CAPITAL LETTER E 

LEI 2 

LATIN CAPITAL LETTER E WITH ACUTE 

LE22 

LATIN CAPITAL LETTER E WITH CARON 

LEI 6 

LATIN CAPITAL LETTER E WITH CIRCUMFLEX 

LEI 8 

LATIN CAPITAL LETTER E WITH DIAERESIS 

LE30 

LATIN CAPITAL LETTER E WITH DOT ABOVE 

LEU 

LATIN CAPITAL LETTER E WITH GRAVE 

LE32 

LATIN CAPITAL LETTER E WITH MACRON 

LE44 

LATIN CAPITAL LETTER E WITH OGONEK 

LN62 

LATIN CAPITAL LETTER ENG 

LF02 

LATIN CAPITAL LETTER F 

LG02 

LATIN CAPITAL LETTER G 

LG24 

LATIN CAPITAL LETTER G WITH BREVE 

LG42 

LATIN CAPITAL LETTER G WITH CEDILLA 

LG16 

LATIN CAPITAL LETTER G WITH CIRCUMFLEX 

LG30 

LATIN CAPITAL LETTER G WITH DOT ABOVE 

LH02 

LATIN CAPITAL LETTER H 

LH16 

LATIN CAPITAL LETTER H WITH CIRCUMFLEX 

LH62 

LATIN CAPITAL LETTER H WITH STROKE 

LI02 

LATIN CAPITAL LETTER 1 

L1 1 2 

LATIN CAPITAL LETTER 1 WITH ACUTE 

L1 1 6 

LATIN CAPITAL LETTER 1 WITH CIRCUMFLEX 

L1 1 8 

LATIN CAPITAL LETTER 1 WITH DIAERESIS 

LI30 

LATIN CAPITAL LETTER 1 WITH DOT ABOVE 

LIU 

LATIN CAPITAL LETTER 1 WITH GRAVE 

LI32 

LATIN CAPITAL LETTER 1 WITH MACRON 

LI44 

LATIN CAPITAL LETTER 1 WITH OGONEK 

LI20 

LATIN CAPITAL LETTER 1 WITH TILDE 

LJ02 

LATIN CAPITAL LETTER J 

LJ16 

LATIN CAPITAL LETTER J WITH CIRCUMFLEX 

LK02 

LATIN CAPITAL LETTER K 

LK42 

LATIN CAPITAL LETTER K WITH CEDILLA 

LL02 

LATIN CAPITAL LETTER L 

LL12 

LATIN CAPITAL LETTER L WITH ACUTE 


26 




Table B.1 - (continued) 


ID 

Name 

LL22 

LATIN CAPITAL LETTER L WITH CARON 

LL42 

LATIN CAPITAL LETTER L WITH CEDILLA 

LL64 

LATIN CAPITAL LETTER L WITH MIDDLE DOT 

LL62 

LATIN CAPITAL LETTER L WITH STROKE 

LM02 

LATIN CAPITAL LETTER M 

LN02 

LATIN CAPITAL LETTER N 

LN12 

LATIN CAPITAL LETTER N WITH ACUTE 

LN22 

LATIN CAPITAL LETTER N WITH CARON 

LN42 

LATIN CAPITAL LETTER N WITH CEDILLA 

LN20 

LATIN CAPITAL LETTER N WITH TILDE 

LO02 

LATIN CAPITAL LETTER 0 

L012 

LATIN CAPITAL LETTER 0 WITH ACUTE 

L016 

LATIN CAPITAL LETTER O WITH CIRCUMFLEX 

L01 8 

LATIN CAPITAL LETTER 0 WITH DIAERESIS 

L026 

LATIN CAPITAL LETTER 0 WITH DOUBLE ACUTE 

LOU 

LATIN CAPITAL LETTER 0 WITH GRAVE 

L032 

LATIN CAPITAL LETTER 0 WITH MACRON 

L062 

LATIN CAPITAL LETTER 0 WITH STROKE 

LO20 

LATIN CAPITAL LETTER 0 WITH TILDE 

LP02 

LATIN CAPITAL LETTER P 

LQ02 

LATIN CAPITAL LETTER Q 

LR02 

LATIN CAPITAL LETTER R 

LR12 

LATIN CAPITAL LETTER R WITH ACUTE 

LR22 

LATIN CAPITAL LETTER R WITH CARON 

LR42 

LATIN CAPITAL LETTER R WITH CEDILLA 

LS02 

LATIN CAPITAL LETTER S 

LSI 2 

LATIN CAPITAL LETTER S WITH ACUTE 

LS22 

LATIN CAPITAL LETTER S WITH CARON 

LS42 

LATIN CAPITAL LETTER S WITH CEDILLA 

LSI 6 

LATIN CAPITAL LETTER S WITH CIRCUMFLEX 

LT02 

LATIN CAPITAL LETTER T 

LT22 

LATIN CAPITAL LETTER T WITH CARON 

LT42 

LATIN CAPITAL LETTER T WITH CEDILLA 

LT62 

LATIN CAPITAL LETTER T WITH STROKE 

LT64 

LATIN CAPITAL LETTER THORN 

LU02 

LATIN CAPITAL LETTER U 

LU12 

LATIN CAPITAL LETTER U WITH ACUTE 

LU24 

LATIN CAPITAL LETTER U WITH BREVE 

LU16 

LATIN CAPITAL LETTER U WITH CIRCUMFLEX 

LU18 

LATIN CAPITAL LETTER U WITH DIAERESIS 

LU26 

LATIN CAPITAL LETTER U WITH DOUBLE ACUTE 

LU14 

LATIN CAPITAL LETTER U WITH GRAVE 

LU32 

LATIN CAPITAL LETTER U WITH MACRON 

LU44 

LATIN CAPITAL LETTER U WITH OGONEK 

LU28 

LATIN CAPITAL LETTER U WITH RING ABOVE 

LU20 

LATIN CAPITAL LETTER U WITH TILDE 

LV02 

LATIN CAPITAL LETTER V 

LW02 

LATIN CAPITAL LETTER W 




Table B.1 - (continued) 


“ID 

Name 

LW16 

LATIN CAPITAL LETTER W WITH CIRCUMFLEX 

LX02 

LATIN CAPITAL LETTER X 

LY02 

LATIN CAPITAL LETTER Y 

LY12 

LATIN CAPITAL LETTER Y WITH ACUTE 

LY16 

LATIN CAPITAL LETTER Y WITH CIRCUMFLEX 

LY18 

LATIN CAPITAL LETTER Y WITH DIAERESIS 

LZ02 

LATIN CAPITAL LETTER Z 

LZ12 

LATIN CAPITAL LETTER Z WITH ACUTE 

LZ22 

LATIN CAPITAL LETTER Z WITH CARON 

LZ30 

LATIN CAPITAL LETTER Z WITH DOT ABOVE 

LI64 

LATIN CAPITAL LIGATURE IJ 

L064 

LATIN CAPITAL LIGATURE OE 

LA01 

LATIN SMALL LETTER A 

LA1 1 

LATIN SMALL LETTER A WITH ACUTE 

LA23 

LATIN SMALL LETTER A WITH BREVE 

LAI 5 

LATIN SMALL LETTER A WITH CIRCUMFLEX 

LAI 7 

LATIN SMALL LETTER A WITH DIAERESIS 

LAI 3 

LATIN SMALL LETTER A WITH GRAVE 

LA31 

LATIN SMALL LETTER A WITH MACRON 

LA43 

LATIN SMALL LETTER A WITH OGONEK 

LA27 

LATIN SMALL LETTER A WITH RING ABOVE 

LAI 9 

LATIN SMALL LETTER A WITH TILDE 

LA61 

LATIN SMALL LETTER AE 

LB01 

LATIN SMALL LETTER B 

LC01 

LATIN SMALL LETTER C 

LC1 1 

LATIN SMALL LETTER C WITH ACUTE 

LC21 

LATIN SMALL LETTER C WITH CARON 

LC41 

LATIN SMALL LETTER C WITH CEDILLA 

LC15 

LATIN SMALL LETTER C WITH CIRCUMFLEX 

LC29 

LATIN SMALL LETTER C WITH DOT ABOVE 

LD01 

LATIN SMALL LETTER D 

LD21 

LATIN SMALL LETTER D WITH CARON 

LD61 

LATIN SMALL LETTER D WITH STROKE 

LI61 

LATIN SMALL LETTER DOTLESS 1 

LE01 

LATIN SMALL LETTER E 

LE1 1 

LATIN SMALL LETTER E WITH ACUTE 

LE21 

LATIN SMALL LETTER E WITH CARON 

LEI 5 

LATIN SMALL LETTER E WITH CIRCUMFLEX 

LEI 7 

LATIN SMALL LETTER E WITH DIAERESIS 

LE29 

LATIN SMALL LETTER E WITH DOT ABOVE 

LEI 3 

LATIN SMALL LETTER E WITH GRAVE 

LE31 

LATIN SMALL LETTER E WITH MACRON 

LE43 

LATIN SMALL LETTER E WITH OGONEK 

LN61 

LATIN SMALL LETTER ENG 

LD63 

LATIN SMALL LETTER ETH 

LF01 

LATIN SMALL LETTER F 

LG01 

LATIN SMALL LETTER G 

LG23 

LATIN SMALL LETTER G WITH BREVE 

LG42 

LATIN SMALL LETTER G WITH CEDILLA 
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Table B.1 - (continued) 


ID 

Name 

LG1 5 

LATIN SMALL LEI 1 ER G WITH CIRCUMFLEX 

LG29 

LATIN SMALL LETTER G WITH DOT ABOVE 

LH01 

LATIN SMALL LETTER H 

LH15 

LATIN SMALL LETTER H WITH CIRCUMFLEX 

LH61 

LATIN SMALL LETTER H WITH STROKE 

LI01 

LATIN SMALL LETTER 1 

LI1 1 

LATIN SMALL LETTER 1 WITH ACUTE 

L1 1 5 

LATIN SMALL LETTER 1 WITH CIRCUMFLEX 

L1 1 7 

LATIN SMALL LETTER 1 WITH DIAERESIS 

LI13 

LATIN SMALL LETTER 1 WITH GRAVE 

LI31 

LATIN SMALL LETTER 1 WITH MACRON 

LI43 

LATIN SMALL LETTER 1 WITH OGONEK 

L1 1 9 

LATIN SMALL LETTER 1 WITH TILDE 

LJ01 

LATIN SMALL LETTER J 

LJ15 

LATIN SMALL LETTER J WITH CIRCUMFLEX 

LK01 

LATIN SMALL LETTER K 

LK41 

LATIN SMALL LETTER K WITH CEDILLA 

LK61 

LATIN SMALL LETTER KRA 

LL01 

LATIN SMALL LETTER L 

LL1 1 

LATIN SMALL LETTER L WITH ACUTE 

LL21 

LATIN SMALL LETTER L WITH CARON 

LL41 

LATIN SMALL LETTER L WITH CEDILLA 

LL63 

LATIN SMALL LETTER L WITH MIDDLE DOT 

LL61 

LATIN SMALL LETTER L WITH STROKE 

LM01 

LATIN SMALL LETTER M 

LN01 

LATIN SMALL LETTER N 

LN63 

LATIN SMALL LETTER N PRECEDED BY APOSTROPHE 

LN1 1 

LATIN SMALL LETTER N WITH ACUTE 

LN21 

LATIN SMALL LETTER N WITH CARON 

LN41 

LATIN SMALL LETTER N WITH CEDILLA 

LN19 

LATIN SMALL LETTER N WITH TILDE 

LOOI 

LATIN SMALL LETTER O 

L01 1 

LATIN SMALL LETTER 0 WITH ACUTE 

L015 

LATIN SMALL LETTER O WITH CIRCUMFLEX 

L017 

LATIN SMALL LETTER 0 WITH DIAERESIS 

L025 

LATIN SMALL LETTER 0 WITH DOUBLE ACUTE 

L013 

LATIN SMALL LETTER 0 WITH GRAVE 

L031 

LATIN SMALL LETTER 0 WITH MACRON 

L061 

LATIN SMALL LETTER 0 WITH STROKE 

L019 

LATIN SMALL LETTER 0 WITH TILDE 

LP01 

LATIN SMALL LETTER P 

LQ01 

LATIN SMALL LETTER Q 

LR01 

LATIN SMALL LETTER R 

LR1 1 

LATIN SMALL LETTER R WITH ACUTE 

LR21 

LATIN SMALL LETTER R WITH CARON 

LR41 

LATIN SMALL LETTER R WITH CEDILLA 

LS01 

LATIN SMALL LETTER S 

LS1 1 

LATIN SMALL LETTER S WITH ACUTE 




Table B.1 - (continued) 


ID 

Name 

LS21 

LATIN SMALL LETTER S WITH CARON 

LS41 

LATIN SMALL LETTER S WITH CEDILLA 

LSI 5 

LATIN SMALL LETTER S WITH CIRCUMFLEX 

LS61 

LATIN SMALL LETTER SHARP S 

LT01 

LATIN SMALL LETTER T 

LT21 

LATIN SMALL LETTER T WITH CARON 

LT41 

LATIN SMALL LETTER T WITH CEDILLA 

LT61 

LATIN SMALL LETTER T WITH STROKE 

LT63 

LATIN SMALL LETTER THORN 

LU01 

LATIN SMALL LETTER U 

LU1 1 

LATIN SMALL LETTER U WITH ACUTE 

LU23 

LATIN SMALL LETTER U WITH BREVE 

LU15 

LATIN SMALL LETTER U WITH CIRCUMFLEX 

LU17 

LATIN SMALL LETTER U WITH DIAERESIS 

LU25 

LATIN SMALL LETTER U WITH DOUBLE ACUTE 

LU13 

LATIN SMALL LETTER U WITH GRAVE 

LU31 

LATIN SMALL LETTER U WITH MACRON 

LU43 

LATIN SMALL LETTER U WITH OGONEK 

LU27 

LATIN SMALL LETTER U WITH RING ABOVE 

LU19 

LATIN SMALL LETTER U WITH TILDE 

LV01 

LATIN SMALL LETTER V 

LW01 

LATIN SMALL LETTER W 

LW15 

LATIN SMALL LETTER W WITH CIRCUMFLEX 

LX01 

LATIN SMALL LETTER X 

LY01 

LATIN SMALL LETTER Y 

LY1 1 

LATIN SMALL LETTER Y WITH ACUTE 

LY15 

LATIN SMALL LETTER Y WITH CIRCUMFLEX 

LY17 

LATIN SMALL LETTER Y WITH DIAERESIS 

LZ01 

LATIN SMALL LETTER Z 

LZ1 1 

LATIN SMALL LETTER Z WITH ACUTE 

LZ21 

LATIN SMALL LETTER Z WITH CARON 

LZ29 

LATIN SMALL LETTER Z WITH DOT ABOVE 

LI63 

LATIN SMALL LIGATURE IJ 

L063 

LATIN SMALL LIGATURE OE 

SM1 1 

LEFT CURLY BRACKET 

SP21 

LEFT DOUBLE QUOTATION MARK 

SP06 

LEFT PARENTHESIS 

SP17 

LEFT-POINTING DOUBLE ANGLE QUOTATION MARK 

SP19 

LEFT SINGLE QUOTATION MARK 

SM06 

LEFT SQUARE BRACKET 

SM30 

LEFTWARDS ARROW 

SA03 

LESS-THAN SIGN 

SP09 

LOW LINE 

SD31 

MACRON 

SM20 

MASCULINE ORDINAL INDICATOR 

SMI 7 

MICRO SIGN 

SM26 

MIDDLE DOT 
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Table B.1 - (concluded) 


ID 

Name 

SA07 

MULT IPLICA I ION SIGN 

SP30 

NO-BREAK SPACE 

SM66 

NOT SIGN 

SM01 

NUMBER SIGN 

SD43 

OGONEK 

SM18 

OHM SIGN 

SM02 

PERCENT SIGN 

SM25 

PILCROW SIGN 

SA01 

PLUS SIGN 

SA02 

PLUS-MINUS SIGN 

SC02 

POUND SIGN 

SP15 

QUESTION MARK 

SP04 

QUOTATION MARK 

SM53 

REGISTERED SIGN 

SM07 

REVERSE SOLIDUS 

SM14 

RIGHT CURLY BRACKET 

SP22 

RIGHT DOUBLE QUOTATION MARK 

SP07 

RIGHT PARENTHESIS 

SP18 

RIGHT-POINTING DOUBLE ANGLE QUOTATION MARK 

SP20 

RIGHT SINGLE QUOTATION MARK 

SM08 

RIGHT SQUARE BRACKET 

SM31 

RIGHTWARDS ARROW 

SD27 

RING ABOVE 

SM24 

SECTION SIGN 

SP14 

SEMICOLON 

SP32 

SOFT HYPHEN 

SP12 

SOLIDUS 

SP01 

SPACE 

NS01 

SUPERSCRIPT ONE 

NS03 

SUPERSCRIPT THREE 

NS02 

SUPERSCRIPT TWO 

SD19 

TILDE 

SM54 

TRADE MARK SIGN 

SM32 

UPWARDS ARROW 

SM13 

VERTICAL LINE 

NF20 

VULGAR FRACTION FIVE EIGHTHS 

NF18 

VULGAR FRACTION ONE EIGHTH 

NF01 

VULGAR FRACTION ONE HALF 

NF04 

VULGAR FRACTION ONE QUARTER 

NF21 

VULGAR FRACTION SEVEN EIGHTHS 

NF19 

VULGAR FRACTION THREE EIGHTHS 

NF05 

VULGAR FRACTION THREE QUARTERS 

SC05 

YEN SIGN 




Annex C 

(informative) 


Use of non-spacing diacritical marks 


The supplementary set (see tables 1 and 3) contains 13 non-spacing diacritical marks which are used in 
combination with the letters of the basic Latin alphabet in the primary set, and with SPACE, to represent accented 
letters and diacritical marks as separate graphic characters. 

The combinations of non-spacing diacritical marks and basic letters which are defined in this International Standard 
are given in table C.1 which also gives ligatures and other special letters. 

NOTE 16: The term "non-spacing diacritical mark" is used in this International Standard in a metaphorical sense 
only. The "combination" of a non-spacing diacritical mark with a basic letter does not "generate" a new letter, but 
only indicates how a letter from the repertoire of this International Standard is to be coded. 

Table C.1 - Combinations of diacritical marks and basic letters 


BASIC 

LETTER 

acute 

grave 

circum 

flex 


tilde 

caron 

breve 

double 

acute 

ring 

above 

dot 

above 

macron 

cedilla 

ogonek 

ligature 

others 

aA 

aA 

aA 

aA 

aA 

aA 


aA 


aA 


aA 


& 


aa/E 

cC 

60 


cC 



cC 




cC 


SQ 




dD 






dD 









SdD 

eE 

eE 

eE 

eE 

eE 


eE 




eE 

eE 





gG 



gG 




gG 



gG 


n 




hH 



hH 












hH 

il 

m 

m 

mm 

m 

m 





m 



EH 


1 

U 



jj 













kK 












kK 



K 

IL 

I 





IL 









tLIL 

nN 

nN 




nN 

nN 






ijty 



hr)f] 

oO 

60 

60 

60 

60 

60 



60 



60 



oeCE 

00 

rR 

rR 





rR 






t? 




sS 

sS 


sS 



sS 






?§ 



(3 

tT 






if 






il 



tTpP 

uU 

uU 

uU 

uG 

uG 

uG 


uG 

uG 

uU 


uG 


yG 



wW 



wW 













yY 

yY 


yY 

yY 












zZ 

zZ 





zZ 




zZ 






(SP) 



A 


- 

v 

w 

" 




> 

C 
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Annex D 

(informative) 

Use of Latin alphabetic characters in various languages 


Table D.1 summarizes the use of the Latin alphabetic characters defined in this International Standard in 41 
different languages (39 European languages, Afrikaans and Esperanto). 

The 26 basic letters of the Latin alphabet have not been included in the table because they are considered 
indispensable in all languages, even though several languages do not require letters such as q or w for their own 
orthographies. 

Table D.1 is intended to provide justification for the composition of the alphabetic part of the graphic character 
repertoire. It does not attempt to define which characters should, and which ones should not, be used in any 
language. 

NOTE 16 Usage within any country or areas is to some extent dependent on the text, its intended use and 
its form of presentation. Furthermore, it is common in many languages to include loan words" taken from other 
languages. The requirements for these specialties have not been shown in this table except where such loan words 
have such long-standing or widespread use that they are now considered to be "naturalized" rather than "foreign" 
words in a particular language. 

NOTE 17 See note 12 page 7. 

NOTE 18 As a result of a spelling reform of Greenlandic in 1973, the following characters are depreciated, 
but still used in personal names: 

LATIN CAPITAL LETTER I WITH TILDE 
LATIN SMALL LETTER I WITH TILDE 
LATIN SMALL LETTER KRA 
LATIN CAPITAL LETTER U WITH TILDE 
LATIN SMALL LETTER U WITH TILDE 

NOTE 19 For spelling the Welsh language correctly, some more letters are in fact required. They are not 
included in the repertoire, but are only identified here: 

LATIN CAPITAL LETTER W WITH ACUTE 
LATIN SMALL LETTER W WITH ACUTE 
LATIN CAPITAL LETTER W WITH GRAVE 
LATIN SMALL LETTER W WITH GRAVE 
LATIN CAPITAL LETTER W WITH DIAERESIS 
LATIN SMALL LETTER W WITH DIAERESIS 
LATIN CAPITAL LETTER Y WITH GRAVE 
LATIN SMALL LETTER Y WITH GRAVE 
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Character 

LLLLL 

LLLLL 

LLLLL 

LLLLL 

LLLLL 

LLLLL 

LL 


AAAAA 

AAAAA 

CCCCC 

DDDEE 

EEEEE 

EGGGG 

HH 


11111 

22346 

11224 

26611 

11223 

41224 

16 


13579 

1 1 1 1 1 

37131 

Mill 

15191 

1 1 1 1 1 

11313 

1 1 1 1 1 

57191 

1 1 1 1 1 

35391 

1 1 1 1 1 

51 

1 | 


11112 

22346 

11234 

i m m 
26611 

1 1 1 1 1 
11233 

1 1 1 1 1 
41234 

16 


24680 

48242 

26202 

22224 

68202 

46402 

62 


aaaaS 

aaa^ae 

cficCg 

dddee 

eeeee 

eggg£ 

fill 

Languages 

AAAAA 

aAa^e 

ccccg 

DDDEE 

EEEEE 

?GGG? 

hH 

Afrikaans 

X X 



XX 

XX 



Albanian 



X 


X 



Basque 








Breton 





X 



Catalan 

XX 


X 

XX 




Croat 



X X 

X 




Czech 

X 


X 

X X 

X 



Danish 

X 

X X 


X 




Dutch 

XX X 



X 

X 



English 




XX 

XX 



Esperanto 



X 



X 

X 

Estonian 

X 







Faroese 

X 

X 


X 




Finnish 

X 







French 

XX 

X 

X 

XX 

XX 



Frisian 

XX 



X 

XX 



Galician 

X 



X 




German 

X 







Greenlandic 

XXX 

X X 


X 




Hungarian 

X 



X 




Icelandic 

X 

X 


XX 




Irish 

X 



X 




Italian 

X 



X 




Lapp ( Sami ) 

X XX 

X X 

X 

X XX 

X 



Latvian 


X 

X 


X 

X 


Lithuanian 


X 

X 

X 

X 

X 


Maltese 

X 


X 

X 


X 

X 

Norwegian 


X X 


X 




Occitan 

XX 


X 

XX 

X 



Polish 


X 

X 



X 


Portuguese 

XXX X 



X 

X 



Rhaeto-Romanic 

XX 



XX 

X 



Romanian 

X 

X 






(Scots) Gaelic 

XX 



XX 




Slovak 

X X 



X X 




Slovene 



X 





Sorbian 



X X 


X 



Spanish 

X 







Swedish 

X X 

X 


X 




Turkish 

X 


X 

X 

X 

X 


Welsh 

xxxx 



XX 

XX 































































Character 

LLLLL 

LLLLL 

LLLLL 

LLLLL 

LLLLL 

LLLLL 

LL 


mu 

mu 

JKKLL 

LLLNN 

NNNNO 

00000 

00 


11111 

33466 

14612 

46611 

24661 

11112 

36 


13579 

1 1 1 1 1 

01313 

51111 

11319 

11131 

35795 

11 


11112 

34 6 

14 12 

1=6612 

246 1 

11122 

36 


24680 

24 4 

62 22 

22420 

222 2 

46806 

22 


iiiii 

iiiiij 

] Jpcll 

111 bn 

nijl] ho 

6565 6 

O0 

Languages 

fitii 

II D 

J£ LL 

btLlM 

Nltf] 6 

66000 

00 

Afrikaans 

XX 




X 

XX 


Albanian 








Basque 




X 




Breton 




X 




Catalan 

X X 



X 

X 

X 


Croat 








Czech 

X 




X X 



Danish 

X 




X 


X 

Dutch 

X X 

X 



X 

X 


English 








Esperanto 



X 





Estonian 





X 

XX 


Faroese 

X 






X 

Finnish 






X 


French 

XX 





X 


Frisian 

X 





XX 


Galician 

X 



X 

X 



German 






X 


Greenlandic 

xxx 


X 



X 

X 

Hungarian 

X 




X 

X X 


Icelandic 

X 




X 

X 


Irish 

X 




X 



Italian 

XX X 




X 

X 


Lapp ( Sami ) 

X 



X 

X 

X 

X 

Latvian 


X 

X 

X 

X 


X 

Lithuanian 


X 






Maltese 

XX 





X 


Norwegian 





X 

X 

X 

Occitan 

X X 




X 

X 


Polish 




X X 

X 



Portuguese 

X 




X 

X X 


Rhaeto-Romanic 

X 





xxx 


Romanian 

X 







(Scots) Gaelic 

X 




X 

X 


Slovak 

X 


XX 


X X 

X 


Slovene 








Sorbian 




X X 

X 



Spanish 

XX 



X 

X 



Swedish 






X 


Turkish 

X 

X X 




X 



Welsh 


xxxx 


x 


xxx 






























































Character 

LLLLL 

LLLLL 

LLLLL 

LLLLL 


ORRRS 

SSSST 

TTTUU 

uuuuu 


61241 

12462 

46611 

11122 


31111 

1 1 1 1 1 

51111 

III 1 

11313 

1 1 1 1 1 

57935 

1 1 1 1 1 


61241 

124 2 

1 1 

46611 

11222 


42222 

622 2 

22424 

68046 


cerf^rs 

§s?St 

tt£>uu 

UUUUU 

Languages 

CERRRS 

SS$ T 

tTpuu 

uuuuu 

Afrikaans 




X 

Albanian 





Basque 




X 

Breton 



X 

X 

Catalan 



X 

X 

Croat 


X 



Czech 

X 

X X 

X 


Danish 



X 

X 

Dutch 





English 





Esperanto 


X 


X 

Estonian 


X 


X 

Faroese 



X 


Finnish 





French 

X 


X 

XX 

Frisian 



X 

XX 

Galician 



X 

X 

German 


X 


X 

Greenlandic 



X 

X X 

Hungarian 



X 

X X 

Icelandic 



XX 


Irish 



X 


Italian 



XX 


Lapp ( Sami ) 


X 

X 


Latvian 

X 

X 


X 

Lithuanian 


X 



Maltese 



X 


Norwegian 





Occitan 



X 

X 

Polish 

X 




Portuguese 



X 

X 

Rhae to -Romanic 




X 

Romanian 


X 

X 


(Scots) Gaelic 



X 


Slovak 

X 

X X 



Slovene 


X 



Sorbian 

X X 

X 



Spanish 



X 

X 

Swedish 




X 

Turkish 


X 


XX 

Welsh 



XX 

XX 


LLLLL 

LLLLL 

UUUWY 

YYZZZ 

23411 

11122 

71351 

1 1 1 1 1 

57119 

1 II 1 1 

III 

23411 

11123 

82462 

68220 

duy^ry 

■tyyzzi 

UUUWY 

YYZZZ 




























































Annex E 

(informative) 

Alternative coded representation of the repertoire 
with no non-spacing diacritical marks 


The character repertoire of this International Standard can also be represented in an alternative coding which does 
not require the use of the non-spacing diacritical marks. 

This coded representation is a version of ISO/I EC 4873 Level 2 or 3 that uses the following graphic character sets 
from ISO/I EC 10367: 

- the Basic GO set (ISO-IR 6), 

- Latin alphabet No 1 supplementary set (ISO-IR 100) or Latin alphabet No 5 supplementary set (ISO-IR 148), 

- Latin alphabet No 2 supplementary set (ISO-IR 101), 

- Supplementary set for Latin alphabets No 1 or 5, and 2 (ISO-IR 154). 
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Annex F 

(informative) 
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Annex G 

(informative) 

Main differences between the 1994 (second) edition of ISO/IEC 6937, and 
the present (third) edition of this International Standard 


1 Annex G of the second edition was replaced with a new text. 

2 The names of LATIN SMALL and CAPITAL LETTER AE were changed from the 1994 
edition (where they were called LIGATURE), to align with ISO/IEC 10646-1. 

3 For the same reason, the name MUSIC NOTE was changed to EIGHTH NOTE, and 
TRADEMARK SIGN was changed to TRADE MARK SIGN. 

4 The following short identifiers were changed (see annex B, NOTE 15): 

old new 

LA51 LA61 
LA52 LA62 
LG 11 LG41 
LI51 LI63 
LI52 LI64 
L051 L063 
L052 L064 
SM95 SM65 
SM96 SM66 

5 A number of small corrections and clarifications was applied. 
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